Jay E | RoboNuggets · Youtube · 11:22

Opus 4.8 is NOT Claude's biggest release today

An 11-minute screen-share demo arguing that Dynamic Workflows and UltraCode are the real Opus 4.8 story, and showing what 96 parallel sub-agents actually look like.

Posted
May 28th 2026
yesterday
Duration
11:22
Format
Tutorial
educational
Channel
JE
Jay E | RoboNuggets
§ 01 · The Hook

The bait, then the rug-pull.

The title buries the thesis before the video even starts: the model is not the story. A solo demo creator opens on Anthropic's benchmark table, waves it off as directional noise, then scrolls to the section most viewers would never reach.

§ · Chapters

Where the time goes.

00:00 – 00:51

01 · Hook + Benchmark Dismissal

Opens on the Anthropic announcement page; dismisses benchmarks as directional-only; argues harness updates are where the real power lives.

00:51 – 01:49

02 · Dynamic Workflows Explained

Walks through the Anthropic blog post explaining orchestrator and sub-agent architecture, then introduces the UltraCode effort setting.

01:49 – 02:40

03 · Demo Setup: Ecommerce Audit

Shows the VS Code effort toggle turning purple and the terminal rainbow animation for /workflows. Sets up a three-site DTC brand audit prompt.

02:40 – 05:29

04 · Demo 1: 13-Agent Audit Running

Fires the dynamic workflow, watches it fan out 9 audit agents plus synthesis, checks /workflows for phase status, waits for deliverables.

05:29 – 06:45

05 · Results + Design Iteration

Reviews the Gymshark (AU) report scored 88/100 on technical SEO, notes vanilla white-paper design, applies a design system to reformat.

06:45 – 08:27

06 · UltraCode Setup + Bug Audit

Switches effort to UltraCode via /effort menu, sets up a bug audit of the creator's personal Rubric dashboard app.

08:27 – 09:25

07 · 96-Agent Verification Step

UltraCode does a pre-assessment, then fans out 8 auditors; verification layer adds 88 more agents; /workflows shows all phases live.

09:25 – 10:11

08 · Bug Audit Results

HTML report delivered: 1 critical, 15 high, 30 medium, 24 low, totaling 70 unique bugs each independently verified.

10:11 – 11:22

09 · Token Cost and Rate Limit Critique

Account usage jumped from 2% to 6% weekly. Argues Anthropic should display absolute token counts instead of percentages.

§ · Storyboard

Visual structure at a glance.

Anthropic announcement page
Dynamic Workflows blog post
VS Code effort toggle purple
Ecommerce audit prompt in terminal
/workflows phase view
Gymshark scored PDF report
UltraCode effort slider
88 verification agents running
Bug report 70 unique bugs
Account usage 6% weekly
§ · Frameworks

Named ideas worth stealing.

00:51 model

Dynamic Workflow Architecture

  1. Orchestrator agent (Opus 4.8)
  2. Parallel sub-agents per phase
  3. Synthesis agent
  4. /workflows monitoring layer

A four-layer structure where one orchestrator plans phases, parallel agents do the work, a synthesis agent consolidates, and /workflows gives real-time visibility.

Steal for Any complex research, audit, or code review task where correctness across many files matters more than token cost.
06:45 list

Effort Ladder

  1. Low
  2. Medium
  3. High
  4. xHigh
  5. Max
  6. UltraCode

Six effort tiers in Claude Code; UltraCode adds model-level autonomy to trigger dynamic workflows without explicit user instruction.

Steal for Calibrating session cost vs. correctness for different task types.
05:29 concept

Two-Pass Delivery

Run the research or audit agent first for technically correct output, then prompt with a design system to make it client-ready.

Steal for Any client-facing deliverable generated by AI.
§ · Quotables

Lines you could clip.

00:00
"While most people are focusing on these trust me, bro, benchmarks, I think the real unlock are the major improvements that they made to their coding harness."
Contrarian opener with a memorable phrase; no setup needed. → TikTok hook
04:05
"It costed us 4% of our weekly rate limit. So that tells you, number one, how token intensive these modes are, so be warned."
Concrete warning with a real number that reframes the feature as a tradeoff. → IG reel cold open
10:11
"I would much rather them have an absolute number of tokens in here so that we can really measure when they're saying that they're increasing rate limits or not."
Specific product critique that resonates with power users; standalone clip. → newsletter pull-quote
§ · Resources Mentioned

Things they pointed at.

02:40productRoboNuggets community
06:47productRubric personal dashboard app
§ · CTA Breakdown

How they asked for the click.

11:11 subscribe
"If that's useful, then consider subscribing because that helps us a lot to put out more educational content like this."

Single closing line after all main content, no pre-CTA pitch, low pressure.

§ 04 · The Script

Word for word.

HOOK opening / re-engagementCTA the pitch metaphor
00:00HOOKSo Cloud Opus four dot eight is here. And while most people are focusing on these trust me, bro, benchmarks, I think the real unlock are the major improvements that they made to their coding hardness Cloud Code. So in this video, I'll give you a rundown of this ultra code mode and how to use it and also their new dynamic workflows feature that is Anthropic's answer to long running agents.
00:16HOOKLet's dive into it.
00:21So OPUS four dot eight is now out. And as usual, they published the benchmarks in here, which you can just browse on your own time. But at least after having worked with these models quite a lot already, I think these benchmarks are good to show incremental improvements on where these new models and releases are directionally strong at.
00:36But in my experience, a lot of the power of these models now don't actually directly come from the models, but mostly come from the updates to the harness itself. So if you scroll down to their update in here, they sort of buried the lead in here to put this dynamic workflows feature in this also launching today section.
00:53And if you go through their document, dynamic workflows in Cloud Code, basically, what it does is it helps Cloud take on the most challenging task end to end. So what that means is, let's say, have a problem that is too big for one pass by a single agent and you actually want multiple concurrent agents or sub agents to accomplish that task, you can use dynamic workflows in order to have one orchestrator agent, which is gonna be OPUS four dot eight, and have it orchestrate this multiphase
01:18plan to accomplish that one big task that you assign to it. And apart from dynamic workflows, another thing that they released that they, again, sort of buried the lead in is this new Cloud Code specific setting called Ultra Code. And what it does is set the effort level to extra high while also letting Cloud decide automatically
01:36when to use a dynamic workflow to handle your task. So what does that look like in practice, and how can we invoke these? Well, let's just go into a demo and try it out ourselves.
01:44So to use dynamic workflows and UltraCode, you can actually use the Versus Code extension. So if I go to the effort toggle in here once you update your Cloud Code, you can now see that there is this ultra code option which turns it to purple. And, of course, you can also access it via the terminal view.
01:59And I'll just demo it here because at least in the terminal view, it seems like the Anthropic team deemed this release so important that they even assigned, like, a custom sort of a rainbow color whenever you type in workflows into your prompt in here. And then similarly, if you change the effort to ultra code, that also has that nice animation
02:17that they even coded just for that release. For our test prompt, what I'm going to do is give it a sufficiently complex task where I'm asking it to use dynamic workflows to audit three ecommerce websites for a direct to consumer growth agency in Sydney. So it's going to be a brand audit of these three websites.
02:35CTAAnd for each of these domains, we need a technical SEO scorecard, a content and keyword gap analysis, conversion and user experience flags, three quick wins, three mid effort wins, and basically a brand audit that would have taken a mid class agency a couple of days to put together in the past.
02:51So we'll fire that off using dynamic workflows, and we'll actually see what Cloud Code is going to do with this complex task. And just to show you how much tokens this will consume, I'm currently on the max plan for this account. And right now, my weekly rate limits is consumed at the 2% mark.
03:04So we'll see by the end of this test how much percentage tokens that will consume. And by the way, if you're interested in going from just using AI to getting paid for it, then check out the Robo Nuggets community down in the description. We've got founders in there who landed their first client in weeks, live build sessions where we create this stuff together, and the actual templates behind what I just showed in this video.
03:22The community is also the reason these lessons get made, so see that below if that's for you. Okay. So now that it is running, you can see that what it did here is it's starting to fan out nine audit agents in order to do this task for us.
03:34Now what's good about it is that it actually recognize that dynamic workflows is going to be token intensive. So if you are saving up on tokens, this is probably not something that you would want to just do randomly. But at least for the sake of this demo, let's just go ahead and run it.
03:49Now it's saying that the workflow is running in the background with 13 live fetch agents. And interestingly, you can see what it's doing here that it's acting as sort of the manager or the orchestrator of this whole task. So it's saying that it's using the weight productively and pre building the report generator so that the moment the data lands,
04:05it can turn it into the deliverables fast. So that is what I'm talking about with regard to the harness. Yes.
04:11OPUS 4 Dot 7 to OPUS 4 Dot 8 is a nice job, and they'll always show good benchmarks whenever these new models release. But this sort of user experience and the way these agents are architected really matter a lot more versus the benchmarks that you usually see just the front loaded in a lot of these YouTube videos.
04:28Alright. So this has been running for around five minutes now. One thing you can do actually, because it's saying here that 12 out of 13 agents are done, is you can type in slash workflows.
04:38And what that now shows is a proper plan that your Orchestrator agent has drafted up around this audit. So you can see for phase one, which is the audit itself, those nine agents are already done, and it also shows the amount of tokens that they consumed if you're particular about that. There is a phase two around planning, which I assume is basically planning out its output.
05:00And then now we have this synthesis agent in order to put all of those learnings together. So if it's been running for a while and you need a view of how it's going, then this is one way for you to monitor or observe the progress of your long running task. Alright.
05:14So now it's done, and it gave us a couple of deliverables. It gave us three brand reports for each of those websites. It gave us a comparison sheet as well as the summary document.
05:23So if you look at the executive summary, you can see it has the ranking of those different ecommerce website. It has the top three takeaways. So it highlighted here, let's say, the biggest SEO upside, so definitely agree that that should be top of line for the headlines.
05:37And if you look at the individual PDF reports, you can see a more summarized version per brand around their technical SEO capabilities, content and keywords, and conversion and UX with a lot more detail down the line. Now something that I think it should have done or maybe I should have included in the prompt is a proper design look of this report.
05:54Because right now, it's very vanilla white paper. No one's really gonna read this type of design. Right?
05:58So what you can do here is to just ask Cloud Code to iterate on this. And at least from my side, I have this robo group design system to update the executive summary and those PDFs so that it is a bit more beautifully designed versus this white paper report that is technically rich but is probably not as enticing to read.
06:16Once that's done, it's now been properly formatted and is just much nicer to read. And from here, you can just tweak all the details that it got, turn them into slides, ask Cloud Code to revise the wording of it as you would usually do. But at least a lot of the hard work and a lot of the research behind this report has already been done by your multiple agents, and it only took, like, five minutes.
06:36K. Now let's do another test. And for this one, let's try out the UltraCode effort.
06:41So if you just type in effort, you can change the effort level here to the UltraCode smarter level. And just to simplify this, if in case you haven't used or tweaked effort before,
06:50effort basically pertains to how many times a model thinks about its output before serving it to you. And for UltraCode, can see the sub headline here. That's essentially extra high, plus it gives the model the ability to decide if it wants to use dynamic workflows or not.
07:06So let's just select that. So now we're in UltraCode. And for this task, I'm giving it this prompt where I'm asking it to audit the Rubrik app.
07:13And in case you're new, the Rubrik app is my own personal command center, which I also share with my community. And, basically, whenever I create, like, micro apps that help me with our work, I just put them here to centralize all of those applications in one view. And so what we'll ask OPUS four dot eight to do is audit that whole vibe coded application
07:31and search thoroughly for bugs. So I'm going to ask for a ranked bug list, and I want a stand alone report that will just provide me a nice view of what the bugs are so that we can fix it later.
07:43So I'll fire that off, and I just made it a bit more open ended versus the other tests that we did. Because I think with UltraCode, what should now happen is that if it deems this task to be large enough, which I think it is, it should go ahead and use dynamic workflows on its own accord and actually fan out those sub agents to hit on this task.
08:00Alright. So it's starting that task now. I just like to point out.
08:03So you can see here that when you're on the ultra code effort method, what it's doing here is that right now, it is doing the work as a sole agent, but it has that intelligence to do some initial analysis first and an initial discovery before doing or orchestrating the deep audit.
08:19So you can see here it recognized that UltraCode is on. So it'll orchestrate a fan out audit with adversarial per finding verification. So lots of big words for Opus four dot eight.
08:29But basically, what that means is that initially, I'll do a pre assessment first before doing a proper fan out audit. And if it seems like it's a big task, then I'll decide on my own if I need to spawn some sub agents, some interns to help me out with this task. And now here you go.
08:44It's now doing the audit across eight parallel auditors. So we can actually type in slash workflows now, and what it's now showing us is the status of that task. So we have the rubric bug audit.
08:55We have these eight agents that are all doing the work, all OPUS four dot eight, and you can see here the status of each in terms of that usage. So I just came back to this, and I just wanted to show before I continue this. When it did its verification step, what it did is spawn 88 parallel sub agents in order to verify its findings in here.
09:14So I guess that claim out there is that you can spawn hundreds of parallel sub agents in order to do this long running task is apparently true. So it can happen. And with just this bug report audit, it was able to spawn
09:2496 total sub agents in here. Alright.
09:27So the UltraCode run is now done, and it gave us this HTML page that gave us a view of some of the bugs across my personal dashboard, which is not surprising because at least for my version of this dashboard, it is just for personal use. You can see how powerful this is. Right?
09:40If you have an application or a dashboard that you are serving to clients, you can just use UltraCode and dynamic workflows in order to spawn several sub agents, 96 sub agents, at least for this case, to find critical high and medium bugs, which now from here, you can either read through it or just have Claude code address the top ones.
10:00Alright. And one last thing, if we check back on our account and usage, you can see our weekly rate limits jump from two to 6%. So those two tasks, they're very heavy tasks.
10:10It costed us 4% of our weekly rate limit. So that tells you, number one, how token intensive these modes are, so be warned.
10:18And number two, it sort of kind of tells you how token constrained Tropic still is. In my view, this standard of presenting our rate limits as a percentage should actually be changed. Like, I would much rather them have, an absolute number of tokens in here so that we can really measure when they're saying that they're increasing rate limits or not so that it's much more transparent.
10:36Similar to how you're accessing the Internet and you exactly know how much data you're using as part of your mobile data plans, for example. But anyway, that's a topic for another video. But there you go.
10:45That is UltraCode and dynamic workflows. OPUS four dot eight is great. It's a great incremental release.
10:51But I think for most use cases, the way we work is probably going to be dictated more by the updates that they do to their harness. And UltraCode and dynamic workflows are just two updates that I think are worth paying attention to. If that's useful, then consider subscribing because that helps us a lot to put out more educational content like this.
11:07CTAAs always, thanks for sticking until the end, and I'll see you guys next time. Thank you.
— full transcript
§ 05 · For Joe

Two Claude Code modes that change how you scope a task

WHAT TO LEARN

Dynamic Workflows and UltraCode raise the ceiling on what a single session can accomplish, but they introduce real token costs that make effort level a genuine architectural decision rather than a preference.

  • Dynamic Workflows fans a session out into hundreds of parallel agents, turning multi-day research or audit work into a five-minute deliverable, but token cost scales proportionally.
  • UltraCode gives the model autonomy to decide whether a task warrants spawning sub-agents, so complex prompts get orchestration automatically without requiring an explicit workflow invocation.
  • The /workflows slash command shows a live status panel with agent count, phase completion, and per-agent token usage so you can monitor a long-running task without waiting blind.
  • A technically complete output that ships as an unstyled document will not be read; prompting a design pass as a second step is part of the workflow, not an afterthought.
  • Two heavy demos consumed 4% of a weekly Max plan limit, making effort-tier selection a real cost variable; reserve UltraCode for tasks where correctness across many files justifies the spend.
  • Anthropic presenting rate limits as a percentage of weekly budget makes it structurally impossible to verify whether rate-limit increases are meaningful; absolute token counts would give users real transparency.
§ 06 · Frame Gallery

Visual moments.