WEBVTT

00:00:00.000 --> 00:00:02.720
One of the biggest limitations of Anthropic's

00:00:02.720 --> 00:00:15.360
Claude models today is the lack of true persistent memory across sessions, and this is due to sessions being stateless by design as well as them having a small context window in comparison to other models like Gemini.

00:00:15.775 --> 00:00:23.855
When a chat or coding session ends, the model usually starts fresh the next time. This is where Claude forgets your project context,

00:00:23.935 --> 00:00:24.735
decisions,

00:00:24.735 --> 00:00:37.970
and past work. That means you're forced to actually re explain everything again and again, which not only wastes time, but also burns through your tokens on repeating context instead of actual useful generations.

00:00:38.370 --> 00:00:39.890
This constant repriming

00:00:39.890 --> 00:00:42.130
eats up into your token budget,

00:00:42.290 --> 00:00:47.105
leaving fewer tokens for real reasoning, full use, and high quality output.

00:00:47.265 --> 00:00:55.185
In practice, that can even limit how effectively Claude can use tools or produce deeper, more thoughtful results

00:00:55.265 --> 00:01:03.530
because so much of the budget is actually spent just getting the model back up to speed. But there's a solution to this, and it's called CloudMem.

00:01:03.690 --> 00:01:10.730
CloudMem turns CloudCode into a tool that actually remembers your project history across sessions,

00:01:10.890 --> 00:01:13.850
so you don't have to re explain context every single time.

00:01:14.425 --> 00:01:17.545
Instead of forgetting everything when a session ends,

00:01:17.705 --> 00:01:20.505
Claude memory or Claude mem automatically

00:01:20.505 --> 00:01:24.345
captures what Claude does with tool usage, decisions,

00:01:24.345 --> 00:01:25.545
and observations,

00:01:25.625 --> 00:01:27.625
then compresses that information

00:01:27.705 --> 00:01:31.490
and stores it into a local database with vector search.

00:01:31.650 --> 00:01:37.810
It can inject relevant context back into future sessions so Claude truly remembers your project.

00:01:38.050 --> 00:01:53.035
On top of that, it lets you search your past work using natural language through mem search and MCP tools. And the best part is is that it's open source and runs automatically in the background once you install it. Now to test the impact of persistent memory,

00:01:53.115 --> 00:02:01.200
I used the same detailed dashboard prompt twice, once with claud in a stateless session and once with ClaudeMem enabled.

00:02:01.280 --> 00:02:05.360
And there's definitely a drastic impact in terms of the differentiation.

00:02:05.760 --> 00:02:07.360
Without ClaudeMem,

00:02:07.360 --> 00:02:09.920
Claude was able to produce this functional dashboard,

00:02:10.000 --> 00:02:15.455
but it missed a lot of project specific details. There's a couple of errors already showcasing

00:02:15.615 --> 00:02:23.455
when I am scrolling through this dashboard, repeated generic patterns, and required more iterations to actually align with the editorial.

00:02:23.455 --> 00:02:28.240
The high end product design that was described in this prompt wasn't actually outputted

00:02:28.240 --> 00:02:29.440
with this generation.

00:02:29.520 --> 00:02:32.960
Whereas if I'm to compare it with the CloudMem generation,

00:02:33.120 --> 00:02:36.320
it remembered previous decisions, the tool usage,

00:02:36.480 --> 00:02:42.245
as well as the context from prior sessions, which resulted in a cleaner, more precise dashboard

00:02:42.245 --> 00:02:44.325
that followed the design constraints.

00:02:44.485 --> 00:02:49.605
And you can see that there's subtle interactions with each component, the signature feature

00:02:49.605 --> 00:03:12.975
exactly that was intended from the prompt that was actually sent in. Now I'm not saying that it is perfect because you can see that this generation over here with the chart doesn't look as great. But in terms of adding all of the required features that I'd asked for was added with this generation with Claudemep. The difference clearly shows how persistent memory directly improves output quality, which reduces redundancy

00:03:13.055 --> 00:03:19.375
as well as allows the model to focus its token budget on creating thoughtful production ready UI

00:03:19.455 --> 00:03:21.455
rather than just reconstructing context.

00:03:21.930 --> 00:03:46.445
Shipping features is easy. Shipping the right feature is what actually moves the needle. Most teams either move too slow because they lack the data or move too fast because they're building things users never touch. But today's video sponsor, Posthawk, gives you the full feedback loop. Understand what users do, watch how they interact, and validate changes before rolling them out. That means product analysis,

00:03:46.445 --> 00:03:47.965
session replay, surveys,

00:03:48.320 --> 00:03:49.520
feature flags,

00:03:49.600 --> 00:04:02.720
and a b testing all in one place. You don't need a data team to use it. Setup takes minutes. You just simply install an SDK or paste a snippet and post hoc auto captures page views, clicks, and sessions.

00:04:03.345 --> 00:04:13.425
There's also a generous free tier. Most teams never pay, so you can get real value before committing to anything. So if you're building a product and want to ship with confidence,

00:04:13.585 --> 00:04:16.625
Posthog is worth checking out with the links in the description below.

00:04:17.210 --> 00:04:24.570
Now there's a couple of prerequisites that you're gonna need to have beforehand. You're obviously gonna need to make sure you have Cloud Code installed beforehand.

00:04:24.730 --> 00:05:01.830
If do not have it, you can easily install it for whatever operating system you have, then just make sure you have Node dot g s 18 or above, bun installed u v, and then SQL lite three for persistent storage. And once you have all these requirements fulfilled, we can then get started with the installation. It's super simple. All you gotta do is go into your terminal and just start up cloud code using the Cloud command. Once you have it running, use the slash plugin command, and then what you wanna do is head over to the marketplace by using the arrow key on your keyboard, and you're gonna go ahead and add a new marketplace, and that is by clicking enter.

00:05:02.195 --> 00:05:06.115
This is where you're gonna need to go back into the repo or this doc.

00:05:06.275 --> 00:05:09.875
Essentially, what you wanna do is just copy this last section

00:05:10.035 --> 00:05:20.070
of the creator as well as the repository name. And then what you wanna do is just simply go ahead and paste this in into the add marketplace section and click enter.

00:05:20.230 --> 00:05:23.830
This will clone the repo. Then you'll see within the marketplace section

00:05:23.910 --> 00:05:26.150
the new CloudMem marketplace,

00:05:26.230 --> 00:05:45.720
and you wanna click on enter. You wanna browse the plugin, and you wanna go ahead and install CloudMem if you haven't already. After installing, it's recommended that you close Cloud Code, and then you can restart it back up, and you should have CloudMem now working within your Cloud sessions where you're gonna be able to have persistent memory

00:05:45.720 --> 00:05:47.560
across all of your sessions.

00:05:47.800 --> 00:05:59.165
Now this is gonna be a huge feature that will help you in so many ways. You can see right now that bunnett is actually running, which is also gonna let you view this with the web viewer UI,

00:05:59.405 --> 00:06:04.445
which is where it's gonna prompt open this UI for you to actually manage your session

00:06:04.605 --> 00:06:47.650
as well as have it so that you can interact with your memory. Now there's a lot of different commands that you can use with CloudMem, like injecting your own memory directly within all of your sessions, which is something that you need to be really careful about. Because if you inject the incorrect memory, it could be interfering with future generations and sessions, which is why it's recommended that if you're working with the production build, might wanna consider that you should turn off CloudMem in certain cases. But now that CloudMem is enabled, you have a full cycle where you can now use this memory feature, that persistent memory where it's gonna be able to read all of your files, write them, it can edit, use bash, glob, grep,

00:06:47.730 --> 00:06:49.330
all other cloud tools,

00:06:49.490 --> 00:06:53.330
and it is gonna capture all of that with this persistent memory locally.

00:06:53.645 --> 00:07:02.445
And you're gonna be able to even retrieve context with the slash, uh, commands to even have it so that you can pull and even inject new memory.

00:07:02.605 --> 00:07:11.830
And right now, you can see that there's two new, uh, commands that you'll see. The cloud mem do, which is where you can execute a plan using sub agents for implementation,

00:07:12.150 --> 00:07:18.150
which is a pretty cool new feature, and then creating an implementation plan with the documentation discovery.

00:07:18.230 --> 00:07:24.765
So two of these features can go hand to hand. But you can see right here that I had requested to create a landing page,

00:07:25.005 --> 00:07:28.845
and it is gonna use the plan tool with the memo feature.

00:07:28.925 --> 00:07:41.330
And you can see that it is using multiple tools to help me execute this task right away. Also, an FYI, if you use the MCP tools, you're gonna be able to get better memory search with your project history,

00:07:41.410 --> 00:08:09.130
and you can easily enable this with the link in the description below, which showcases how you can set this up. But you can see that it has created multiple phases as to how it is actually gonna create this, so let's go ahead and have it work upon creating this landing page for us. And here we go. Now I wanna give you guys some context as to why I created a landing page. What does that actually do with persistent memory? Well, the thing is is that I actually went ahead and injected

00:08:09.210 --> 00:08:21.115
a lot of my previous catalogs of landing pages, And I had it reference all of those landing pages because I was gonna have it remember my context that I had with previous generations.

00:08:21.275 --> 00:08:33.310
So I didn't have to re explain what sort of output I was looking for because clearly if you are to request any AI model to generate a landing page, it is gonna generate that typical purple

00:08:33.390 --> 00:08:39.150
AI SaaS landing page that you don't wanna see. In this case, it saves up 95%

00:08:39.150 --> 00:08:46.095
of the tokens each time that you start a session. There's far more tool calls to get the best generations outputted.

00:08:46.095 --> 00:09:05.560
And with memory preserved, you can have it so that Claude can make 20 times more tool calls with ClaudeMem enabled, which is why I get these beautiful generations out of it. And I kid you not, this is another landing page that I had generated with the same prompt, and in this case, it was able to use the catalog of typography

00:09:05.640 --> 00:09:08.200
UI elements that I had saved through other

00:09:08.825 --> 00:09:33.850
generations that I had injected into claudamem, and you can see that it did do a great job with this interactive landing page that was created in a single shot. If you like this video and would love to support the channel, you can consider donating to my channel through the super thanks option below. Or you can consider joining our private discord where you can access multiple subscriptions to different AI tools for free on a monthly basis,

00:09:34.250 --> 00:09:56.940
plus daily AI news and exclusive content, plus a lot more. But that's basically guys for today's video on CloudMem as to how you can have persistent memory across all of your sessions within Cloud Code. This is a remarkable project and there's a lot more to this. You're only gonna get to understand the true capability of it as you use it more and more as you build up your persistent memory

00:09:57.020 --> 00:10:22.175
and have it used across all of your sessions. I'll leave all these links in the description below so that you can easily get started. But if you haven't already, make sure you go ahead and subscribe to our second channel. Join the newsletter. Join the Discord. Follow me on Twitter. And lastly, make sure you guys subscribe, turn on notification bell, like this video, and please take a look at our previous videos so that you can stay up to date with the latest AI news. With that thought, guys, have an amazing day. Spare positivity, and I'll see you guys fairly shortly.
