WEBVTT

00:00:00.160 --> 00:00:02.880
Anthropic just released Claude Opus 4.8.

00:00:02.880 --> 00:00:14.000
I dissected the entire article breaking down the new features as well as I've been testing it out myself. So I'm gonna make a no BS breakdown on exactly what's different. If you're nontechnical,

00:00:14.000 --> 00:00:18.295
I'm not gonna talk benchmarks and complicate this. I'm just gonna talk about tangibly

00:00:18.295 --> 00:00:24.375
what is different, what you can expect from this, and if this is a real game changer that I could almost guarantee

00:00:24.455 --> 00:01:14.880
many YouTubers out there are gonna say, I'm not gonna sit here and hype this up. So without further ado, let's dive into a real breakdown on this. First of all, I read this entire article right here by Anthropic, and I turned it into bite sized breakdowns on the main things and the four main takeaways from this. If you guys want to see the full article, there will be a link in the description so you could just dissect it yourself, but I wanted to condense this for you here. So first things first, here is a breakdown on the benchmarks. We're not gonna talk about this. Basically, all we need to know is, yes, this is better than Claude Opus 4.7 as well as GPT 5.5 and Gemini 3.1 Pro on basically everything here. The only thing that any of these other models are better at than OPUS 4.8 is GPT 5.5 for AgenTic terminal coding, but if you're not using Claude code, this doesn't necessarily pertain to you. Alright. So first of all, let's talk about availability.

00:01:15.325 --> 00:01:31.140
As of right now, Cloud Opus 4.8 is available everywhere today, so we could use this on the desktop app inside of Cloud Code, Cloud Cowork, all these different places we could use it. And then let's talk about pricing as well. So pricing for regular usage is unchanged from OPUS 4.7,

00:01:31.140 --> 00:01:35.700
which is great. That is $5 per million input tokens and $25

00:01:35.700 --> 00:01:44.705
per million output tokens. So in terms of pricing, we don't need to worry about anything upgraded from the OPUS 4.7 model. It's gonna be exactly the same,

00:01:45.025 --> 00:02:09.940
as well as developers can now use Claude OPUS 4.8 via the Claude API. So if you're using Claude code or you're using it as a developer, that's available right now as well. Next up, this is one of the bigger things inside of this release. There is a new effort mode that's inside of the Claude dot a I and the Claude desktop app, meaning that if you've ever used Claude code, you can select which effort level Claude Opus 4.7

00:02:09.940 --> 00:02:22.155
was gonna perform tasks for you. So if you're doing something a bit higher, you could put it on max or high mode, and you can now do this directly inside of the normal Claude app. It does it does say that OPUS 4.8 defaults to high effort,

00:02:22.235 --> 00:02:25.995
which we judge to be the best overall balance of quality and user experience.

00:02:26.500 --> 00:02:36.660
So, again, it's gonna automatically default to this. If you have specific tasks that are a bit easier for it to do, you could always toggle it down to a lower mode so you're not spending as much on your tokens.

00:02:36.820 --> 00:02:46.815
It also looks like if you are using Claude code, users can now choose extra high mode inside of Claude Code, or you could use the normal max mode that, you know, you already could use.

00:02:47.135 --> 00:02:57.695
Then it also looks like there is increased rate limits inside of Claude Code to accommodate the higher token usage of higher effort levels. Um, and, again, users can just select whatever they want for a particular project.

00:02:58.310 --> 00:03:06.070
In order to show you this live in action, here is Claude Cowork. As you could see, I have Opus 4.7 high selected here,

00:03:06.230 --> 00:03:09.430
and now I can go ahead and click on Opus 4.8.

00:03:09.510 --> 00:03:14.535
It automatically defaults to the high mode like I just mentioned before. And if we click,

00:03:14.695 --> 00:03:44.905
uh, on Opus 4.8, we can now see this little effort level, and we can just go and preselect what we want. So we could turn it on low, medium, high, extra, or max, as well as we can, you know, toggle on adaptive thinking. This was already there, but all of these new effort levels that we could see right here is new. So after dissecting this article, one of the main things that I've seen and one of the things that Anthropocos is saying, like, is the biggest part of this release is that one of the most prominent improvements inside of OPUS 4.8

00:03:44.905 --> 00:03:46.505
is its honesty,

00:03:46.505 --> 00:03:52.025
which to me, honestly, sounds like a no brainer. You want AI to give you honest responses.

00:03:52.185 --> 00:04:06.500
And for me, personally, this is something I ran into with Opus 4.7 and any AI model for that matter. But a lot of the times, the information that it would give me just wouldn't be correct, and I'd always have to say, hey. Can you make sure that this is honest and this is the correct information?

00:04:06.740 --> 00:04:16.855
So, apparently, this is gonna be a bit more honest of a model, which I would hope so. It said that we train all of our models to be honest, for instance, to avoid making claims that they can't support,

00:04:17.015 --> 00:04:54.110
which, again, I've had this countless amount of times using other OPUS models before and any Claude model. They did mention that early testers report that OPUS 4.8 is more likely to flag uncertainties about its work and less likely to make unsupported claims. Obviously, we're gonna have to see this in practice, if it actually is more honest, them saying this on paper really doesn't mean much until we test it out. There are a couple of other things that are being released alongside this, so I'm gonna talk about that here in a second. But first of all, let's talk about what Anthropic is saying is what's coming next. So in their own words, they're saying users will find Opus 4.8 to be a modest but tangible improvements on its predecessor.

00:04:54.270 --> 00:04:58.590
So when they are coming out and saying this is just a modest update,

00:04:58.670 --> 00:05:02.115
you know, I'm not expecting this to be a huge leap in capabilities.

00:05:02.115 --> 00:06:37.835
We're probably gonna notice a little upgrade, probably just a marginal one. It's not gonna be anything game changing like I said. Hopefully, it is more honest. You know, as they said, this is, like, a big part of their release. They wanna make this more honest. And they did come out and say that there is still a lot more work to be done. We're working on developing and releasing models that provide many of the same capabilities as Opus at a lower cost, and that is what's coming next. And if you've never heard of Claude Mythos, this is the AI model that apparently is too good for them to release to the public, so they've actually given this to some of the Fortune 500 companies to run internal testing to make sure this is actually safe to give to the public. They came out and said that not only that, but we plan to release a new class of models with even higher intelligence than Opus, and this is the new Claude Mythos preview model. Right here inside of the article, it says that we can expect this inside of Claude in the next coming weeks. It didn't really give an exact date, but just know that this is something that could be coming out in the next two, three, four weeks. Alright. So next up, we have dynamic workflows. This is a new feature available in research preview, allows Claude to take on even bigger tasks inside of Claude code. So if you don't use Claude code so let's say you're just an everyday Claude chat and Claude CoWork user, this won't necessarily pertain to you, but let me break it down just a little bit here. They are claiming that Claude can now plan the work and run hundreds of parallel sub agents in a single session, and it says that with OPUS 4.8, the agents can run for even longer. So, again, that only matters if you are using Claude code. Next up, one of the other changes that we did mention is the effort control inside of claude.ai

00:06:37.835 --> 00:06:47.490
and Claude Cowork, which is something I'm personally happy about. And then this is another small one, but the messages API now accepts system entries inside the messages array,

00:06:47.570 --> 00:06:55.250
and this only matters if you are a developer. Alright. So my main takeaways after reading this article is that this is the same price as OPUS 4.7.

00:06:55.250 --> 00:07:21.760
It's automatically gonna use the new default high effort mode, and so that is great that this isn't a more expensive model. Next, a big part of this release is focused on honest responses, which is a no brainer. Obviously, they should be focusing on that. Then we have the new effort levels. Again, I'm actually pretty excited for this. It's a very small feature, but I think this will be, you know, pretty practical for a lot of people. And then lastly, my main takeaway from this is that these are all just small marginal improvements.

00:07:22.085 --> 00:07:28.085
This is definitely not a game changing release. Of course, it is a new level up from Claude Opus 4.7,

00:07:28.085 --> 00:07:59.975
but I think this is kind of laying the groundwork for a bigger model release in the Claude Mythos preview that should be coming out over the coming weeks. So if you guys do wanna test it out, it's available right now in the Claude desktop app as you can see right here. We could see it inside of Cowork. We could see it inside of normal Claude chat mode, as well as this is available inside of auto code as well. And there we have it. That's a quick breakdown. If you guys got some value from this video, leave a like, subscribe to this channel for more AI content for nontechnical people. I wanna make this video short and to the point, so hope you have a good day, I'll see you in the next video.