WEBVTT

00:00:00.160 --> 00:00:08.560
Opus four eight is out, and it's an absolute smash home run. They implemented so many changes, and a lot of these were hidden. They weren't even in the announcement.

00:00:08.720 --> 00:00:26.185
In this video, we're gonna go over every single one of those changes. I'll tell you exactly what you need to do to take advantage of all these changes, and we'll even run some fun tests with it. If you stick with me until the end, you are going to be a master of the most powerful technology that's out there right now, Claude Opus 4.8.

00:00:26.185 --> 00:00:47.925
Let's go. Here we go. We are not a channel where we sit here for twenty minutes reading blog posts. Let's quickly go through all the changes, then we'll get straight into the product. First things you need to know, it smashes all the benchmarks. It beats Chad GPT five five and all the other models in all the benchmarks. Google's not even close. No one else is even close. This destroyed all the benchmarks. I actually believe

00:00:48.325 --> 00:01:09.820
this is a kinda watered down version of mythos. I'll go into that in a little bit. I'll go into that a little bit, but I actually believe this is mythos but kinda a little bit weaker. It's the same cost. This is mind blowing, and I think this is a result of all the compute Elon Musk just gave to Claude, but it is the same cost as Opus four seven, which is this is the first release

00:01:10.065 --> 00:01:41.845
in quite a bit of time where the price didn't go up. For both OpenAI and Anthropic, all their new releases, the price ticked up slowly and slowly and slowly. This is the first one in a while where the cost did not go up, which, again, just a few weeks ago, Elon sold to Anthropic tens of billions of dollars of compute. I think that is a direct result of that deal, which is really amazing. Here's a big one and also a big reason. I think Elon's helped out Anthropic a lot. Their fast mode is cheaper. A big reason

00:01:42.165 --> 00:01:57.210
I have been using Chad GPT five five inside Codex more the last couple weeks is their fast mode is dirt cheap. You get way better performance for not that much more money. Claud, their fast mode was six times more expensive,

00:01:57.450 --> 00:02:17.015
which is untenable. Their limits were already low, so their fast mode wasn't worth it. Now their fast mode is three times cheaper than it was before, which if we do the advanced algebra here, comes out to just two times more expensive than the regular mode if my math is correct there. So their slash fast mode is actually affordable

00:02:17.015 --> 00:02:20.535
if you're on the $200 plan, and we'll go into exact recommendations

00:02:20.535 --> 00:02:34.040
right after this, so stick around. Beats Chad g b t five five. I thought five five was the first model ever that beat Opus at coding, but this one came right back and beat it four times less hallucinations.

00:02:34.335 --> 00:02:44.255
So this is a big one, and this is a big reason why I think this is actually just mythos but watered down a little bit is this is one of the things they showed off with mythos was the reduction

00:02:44.415 --> 00:02:45.455
in hallucinations,

00:02:45.455 --> 00:02:57.170
about four times reductions. It's matching mythos and a lot of things they advertise it for. And for those who are new to the channel, kinda new to the AI world, mythos is this model that Claude has been advertising now for a couple months.

00:02:57.410 --> 00:03:21.140
They've been advertising as some sort of doomsday model that's outrageously good that can hack any website. They've been teasing us a bit. It appears we're getting closer and closer. One big thing to notice also in their announcement blog post, this wasn't in the tweet as well, they expect to bring mythos class models to all the customers in coming weeks, which means they're doing it. They're actually gonna release Mythos. Again,

00:03:21.380 --> 00:03:44.570
I think Elon Musk saved Anthropic from the dead here. They were starting to lose in every single facet because of their lack of compute, and now they're gonna release Mythos. I don't think it's any coincidence that they're increasing limits, making prices cheaper, and releasing super powerful models within weeks of buying tens of billions of dollars of compute from Elon Musk. Now from a new functionality

00:03:44.570 --> 00:03:45.370
perspective,

00:03:45.370 --> 00:03:51.290
here's the two big things, and we're gonna demo this when we get into the product. Dynamic workflows

00:03:51.370 --> 00:03:52.570
in Ultracode.

00:03:52.570 --> 00:03:55.210
What are these two things? Why are they so big and important?

00:03:55.705 --> 00:03:56.505
Dynamic

00:03:56.505 --> 00:03:57.385
workflows

00:03:57.385 --> 00:03:58.345
is now

00:03:58.505 --> 00:03:59.465
ClaudeCode's

00:03:59.465 --> 00:04:00.425
ability

00:04:00.425 --> 00:04:15.710
to tackle months of work in just a day. What does that mean? If you give Opus four eight a very complex task, a big, juicy, meaty, girthy complex task, it will now spin up between tens

00:04:15.870 --> 00:04:18.510
to thousands of sub agents

00:04:18.670 --> 00:04:26.945
to tackle that task. Say you were trying to implement a really big new feature feature or or one one shot a big app.

00:04:27.185 --> 00:04:38.225
Before, if you gave it to Opus, it would just have one agent go there and add some code, take some code away, add some code, do some research, add some code. Now it's going to take literally thousands of those agents,

00:04:38.730 --> 00:04:46.170
send them out. They're all gonna be touching different pieces of your code base, doing research, testing things out. And simultaneously,

00:04:46.170 --> 00:04:52.810
these tens of thousands of agents are gonna be writing code, testing, using the app, doing regression tests,

00:04:53.245 --> 00:05:03.245
a whole bunch of things. It's going to be really, really powerful, and it is now in Opus four eight. Again, this is going to allow you to do months of work in just one afternoon.

00:05:03.565 --> 00:05:48.935
And then you have ultracode mode, which is basically giving the keys of the kingdom to Claude code, to Opus four eight, and say, hey. Use dynamic workflows whenever you want. This is all you're only using this if you got that $200 a month plan. So this is Opus four eight. I'm gonna go into my exact recommendations on how to use it in a second, then we'll go into the product and demo it out. But, again, absolutely massive changes here. Let's go in the recommendations. Number one, switch all tasks to Opus four eight. There's no reason not to. There's no reason not to go in to Claude code right now, pull it open, and choose Opus four eight. Now you can choose the million context if you want. I find the million context, you don't absolutely have to use it. I find once you start to fill up that million context window,

00:05:49.095 --> 00:06:15.120
the performance actually degrades a good amount. So I'm actually an Opus four eight, which is the regular context type of guy. From a effort perspective, I'm recommending doing a high by default. And then when you are building out much bigger things, switching to extra or max. But I would, by default, stay in high, and then only switch to extra and max when you have to. Despite the fact that Papa Elon allowed Anthropic to have much more compute and capacity,

00:06:15.705 --> 00:06:17.705
It's still not as high capacity

00:06:17.785 --> 00:06:19.145
as ChadGBT.

00:06:19.225 --> 00:06:25.785
So I'm sticking with high for default then do an extra in max if necessary. When it comes to Hermes and Open Claw,

00:06:25.945 --> 00:06:39.510
I wouldn't move it to Opus four eight just yet. This is a big mistake a lot of people make as they try to force their agents into using the latest version Opus the moment it comes out. The issue is this invariably leads to errors, leads to crashes,

00:06:39.670 --> 00:06:46.070
and errors and crashes in Open Claw and Hermes are not the most fun to solve. I would wait until the official releases,

00:06:46.635 --> 00:06:57.115
which typically come within twenty four hours of the release of the model. So once it officially releases, then you switch it over, and you'll have way less crashes and and way less bad reliability.

00:06:57.115 --> 00:07:06.560
As for the slash fast mode and the new ultra code mode, I'm only using those if you're on the $200 a month plan. And even if you're on the $200 a month plan,

00:07:06.800 --> 00:07:11.200
I don't know if I'm using it for every single prompt. Like, ChadGPT codecs,

00:07:11.200 --> 00:07:17.280
I'm using fast mode for literally everything because they give you so much capacity. Claude, their limits still aren't as high as ChadGPT.

00:07:17.825 --> 00:07:27.665
So for me, I actually have extra usage on, which means once I get past limits, I just pay through the API. So I'm actually gonna be using fast and ultra code for almost everything.

00:07:27.905 --> 00:07:51.165
But for you, if you don't have the extra capacity, if you're not on $200 a month plan, then I wouldn't use these modes. Totally up to you. And here's the last recommendation before we get into the products and we do some cool demos. You need to lock the hell in. You need to lock the hell in. I've been working with a lot of people lately, watching how they vibe code, seeing what they do. One issue I'm seeing is AI is enabling a lot of people to get wildly distracted.

00:07:51.325 --> 00:07:59.565
They will send a prompt to their AI, and then they will go and doom scroll for an hour despite the fact that their AI finished the task, like, fifty minutes earlier.

00:08:00.060 --> 00:08:15.435
You cannot get distracted. If you can get into a flow state and lock the f in, you are going to get so much more done. I truly believe the number one indicator of how successful someone will be in 2026

00:08:15.435 --> 00:08:17.355
is their level of focus.

00:08:17.515 --> 00:08:29.940
Do not allow this extra power to mean you can slack off more. Use this extra power to get more done. So really work on your focus. Put the phone away. Close social media. Close Twitter. Close YouTube.

00:08:30.100 --> 00:08:37.540
And just lock in, and you'll get so much more out of this tech. Now let's jump into the product and build some cool things out. I'm using Claude Co desktop.

00:08:37.780 --> 00:09:05.140
You can use the CLI or the extension to take advantage of Opus four eight right now. I'm going to run one of the world famous Alex Spin benchmarks on this model. This is a benchmark I've ran on every single model. Up until now, Opus four seven's actually been king with by far the best scores in all four of these tests. We're gonna run the three d first person shooter test here, see how it does, see how it compares to the other models. If you wanna run this benchmark yourself, I'll put the

00:09:05.780 --> 00:09:29.460
prompt for this down below so you can run your own world famous Alex Finn benchmark. I'm gonna hit enter on this, and I'm gonna send it off, and we're gonna see how it does. Basically, what we're gonna have it do is we're giving it creative freedom. We're saying build a three d first person shooter using three JS. Do whatever you want. Make it as creative as humanly possible. Add power ups. Do whatever you want. We'll see how good Opus does here. Side note, remote control is active.

00:09:29.620 --> 00:10:13.195
I there's actually a setting in Claude code not many people know about. You should be using this setting. It turns remote control on by default for every single chat. What this allows you to do is whenever you spin up a new chat in Claude, you can actually go on your phone, go into the code section in the top left. And as you can see here, that chat I just started is now on the screen. Create the stylistic three d first person shooter. So I can now go mobile whenever I want with every single chat I start. So make sure to turn that on. A little tip for you there. A little bonus tip. Go in the settings. Turn on remote controls active. The only thing I ask for for that tip is you tip me with a like down below. Subscribe if you learned anything so far. Turn on notifications.

00:10:13.435 --> 00:10:37.945
And I'm going to do a full boot camp on Opus 48 tomorrow in the Vibe Coding Academy. Make sure to join that number one AI community on planet Earth. Link down below. Best decision you'll ever make in your entire life. Alright. Looks like it's done. It even tested itself, which is sick. Let's see how this is. Neon assault. It's always neon themed. I have no idea why. First model that makes a non neon themed game, I'm gonna give it a 10 out of 10. Here we go. Let's engage.

00:10:38.345 --> 00:10:48.345
This is nice. This is nice. These graphics are very, very nice. Much I mean, if you're this is your first time watching my channel, you might think, what the hell is this guy talking about? This sucks. This isn't cyberpunk 2027.

00:10:48.750 --> 00:10:53.070
But if you compare this to the default apps that previous models have built,

00:10:53.390 --> 00:10:56.510
this is pretty nice with from the walls to the ground.

00:10:56.830 --> 00:10:59.630
These are the enemies. Oh, to the way the gun shoots,

00:10:59.710 --> 00:11:04.855
to the way you can see hit markers on the enemies. I assume these are even the power ups look nicer.

00:11:09.015 --> 00:11:11.895
Wave two. So they got combos. They got waves.

00:11:12.135 --> 00:11:15.735
This is for sure an upgrade and probably

00:11:15.735 --> 00:11:17.655
the best version of this we've seen yet.

00:11:18.540 --> 00:11:20.220
Oh, this is an enemy. Okay.

00:11:21.900 --> 00:11:28.460
This is probably a step above what four seven gave to me. Probably just a small step. So I'm gonna give it a 9.1.

00:11:28.460 --> 00:11:42.145
I'm gonna run the next three benchmarks probably on a livestream the next week. If you wanna see that, make sure to turn on notifications down below for that. Again, here's a reminder of my recommendations. You wanna be jumping on this now. When they release

00:11:42.305 --> 00:11:43.665
new technology,

00:11:43.905 --> 00:12:07.425
you have a distinct advantage if you start using it right away. Your competition probably isn't using Opus 40. They're probably not using the dynamic mode that sends out tens of thousands of sub agents. They're probably not using that. So if you go and you use this tech and you build out really, really cool things, you are going to have a distinct advantage over the rest of the field. So you wanna make sure today,

00:12:07.825 --> 00:12:11.585
carve off some time in your calendar, go on do not disturb mode,

00:12:11.905 --> 00:12:41.915
close out all the doom scrolls you got, the tickety tocks, the Twitters, all of that, and lock in and use this and build cool things because you have an advantage right now over everyone else if you take advantage of all these different features and functionality they just released. Let me know what you want next about Claw. Do you want tutorials and how to build really complex apps? You want deep dives into functionality? Do you want more benchmarking to see if it's the best? Let me know down in the comments. I'm super curious what you want. All my videos are based on your feedback. I hope this is helpful. See you in the next