WEBVTT

00:00:00.080 --> 00:00:13.280
Called memory systems are a cheat code, but only if you use them properly. And in this video, I'm gonna show you exactly how to set up your own second AI brain that works across all apps with an incredibly

00:00:13.280 --> 00:00:55.455
simple setup. So you can have a memory operating system that makes you 10 times more productive and stops wasting your time. And if you don't know who I am, my name is Jack Roberts. I built and sold my last tech startup with a gazillion customers. Now I build my own AI businesses, and I just share the stuff that actually works. So if you haven't already, grab that beautiful coffee and let's dive straight in. So this is the Claude memory system, and I've spent over a year looking at memory systems. I've done a lot of research. This is the simplest one that I've actually found. So when we talk about the Claude code memory system, what do I actually mean? What does great look like? Let's define what that is before we think about what we're gonna build. Well, the first thing that a great memory system does is it remembers

00:00:55.455 --> 00:01:30.140
everything that you say. Right? Not just last time messages, not just the thread, every meaningful exchange that you've ever had with it, it remembers it and we can capture it and refer to it whenever we want to, just like this beautiful filing cabinet here. So the first goal is it needs to remember what we've said. The second thing is that we can change the the important stuff on the fly. Right? Maybe we don't wanna focus on strategy a anymore. We wanna focus on strategy b or maybe your income's gone up and you wanna change and reflect that in the strategy. So we need the ability to change the important stuff with your role, your stack, whatever it is, we wanna be able to disregard

00:01:30.220 --> 00:01:31.340
old information.

00:01:31.500 --> 00:01:46.525
The third thing that it needs to do, it needs to plug into every platform. This is what I call information silos. It's very common AI, chatty p t for this, Claude for that. Maybe your grandmother's basement for something completely different. We don't wanna be hip and hopping. We need a central

00:01:46.605 --> 00:02:27.830
memory system, which is the universal point of truth. We call it the memory core in this system. Fourthly and finally, it needs to fuel every answer with context. What is the point of this memory system if we can't ask it questions about stuff that's happened and we can't get context rich information when we ask it things about strategy? We have a prompt, we have our three tier three level memory system, then we get the output. That is the basic idea. Memory is not a vault, it's an import. Every prompt silently pulls from your stack. Who you are, what you're shipping, what you just started last month, so response lands sharper than any fresh chat could ever hope to do. Now how many times have you spoken with Claude or ChatGPT to get halfway through the conversation and it's talking complete

00:02:27.990 --> 00:02:28.870
Spanish?

00:02:28.870 --> 00:02:30.390
And it's because it has

00:02:30.630 --> 00:02:45.785
amnesia. Now the best way to solve for this is to think about your memory system as solving this across three levels. Okay? We have short term memory, which is very simply, who am I? We have midterm memory, which is what am I doing? And then we have long term memory, which is what's happened before

00:02:45.945 --> 00:02:51.020
and an expert knowledge base. I'll explain exactly what I mean. Alright. Now not to get metaphysical,

00:02:51.020 --> 00:03:16.065
but the first question you have to ask yourself is who am I? And I'm not talking about glass of red wine at 4AM in the morning. I mean, does your model actually know who you are? We call this the operating manual and it's the very first level, the very first tier of our three level memory system. So think about it like this. Who am I? It's your name, your role, your current goals, the way you like your answers framed, the tools you use, tone voice, non negotiables, stuff that does not change

00:03:16.305 --> 00:03:29.010
every week, but defines every reply. So here's me for example. My name is Jack Roberts, AI YouTuber, direct me fluff, m dashes emojis, vibe is Pacheco, if I'm talking about this particular presentation. Now the idea of this is that it lives natively

00:03:29.345 --> 00:04:04.855
in every single platform and it grows as you talk. So the idea with Claude, for example, has its own internal memory system which gets better over time. And when you express preferences or ask it things, it records that for you. So as you're actually talking to them and you say, hey, remember this or I prefer x so busy, Claude will actually remember that stuff for you. But one little hack I wanna show you real quickly. So first actionable step, go on Claude. If you're coding Claude, come to the bottom left under Jack Roberts, click on obviously, yours may not say Jack Roberts, unless we got the same name. I don't know. Go up to general. And what you're looking for here is instructions for Claude. Put some stuff in here. Try to make it no more than 200 words maximum.

00:04:04.935 --> 00:04:15.580
Just top level of of how you want Claude to behave and any key information about you. Okay? Then if you're in anti gravity, for example, all you do is click on the additional options and you click on customization.

00:04:15.740 --> 00:04:42.945
This will be the same in Versus Code and various other forks. And you'll see you have global and you have workspace. You click on global and these are gonna be Gemini. Mv which are the global instructions for how these models basically behave with you. Now what's important to bear in mind here is that there's two bits of memory. One is explicit hard code stuff, and then we have the generalized knowledge it has of you, which it will natively pick up based on your conversations. So this is an important thing to understand with these memory systems is that the outcome of the conversation

00:04:43.380 --> 00:04:45.060
should never ever

00:04:45.220 --> 00:04:47.300
depend on a chat history.

00:04:47.460 --> 00:05:06.505
I should be able with this system, and you can with this system, that's why it's so cool, open new chat window and regardless of any previous message, it should always give me the best advice possible. But think of this level one as your general top of funnel memory that identifies who you are. It's important to bear in mind that models forget things, they get truncated, they hallucinate.

00:05:06.505 --> 00:05:35.515
If it matters, we need to make sure we're writing it down. Okay. So real quick, what are you doing? Well, not right now. I mean, you and are hanging out together. But I mean, what are you working on right now? Because this is the second level of our three tier system. It's the projects you're doing. They may change from time to time. Maybe you're building a startup. Maybe you've got a really cool client or several cool clients. These are the things that you'll be doing right now and that forms the second level of our system. And that's why in this second level, it's actually quite structural. So what we're going to do essentially is create

00:05:35.675 --> 00:06:00.655
six or seven separate folders for all the things that we're doing. So for example, I might have one for my community. I might have one for my agency. I might have one for my startup. And or, you know, one might be personal life and health and fitness, getting like in shape and all that kind of stuff. So what you need to do first of all is head over to Claude, or this could be your model of choice. I need to ask you this question. Hey. Based on everything that you know about me, all about conversation, all about history and projects,

00:06:00.815 --> 00:06:04.415
I want you to organize all the areas of my life and business into

00:06:04.575 --> 00:06:15.930
six to eight different categories that surmise everything that I'm doing. I'm using this to build up folders to best organize my life and my business. And once you've got that, you can basically create it. And it is surprisingly

00:06:15.930 --> 00:06:37.370
accurate, but you don't want any more than eight. Otherwise, it's way too much to manage. Now what we're gonna do based on these eight things, we'll come down to Shiva's workshop, and what we're effectively doing is creating one unique Claude dot m d file for each of those eight projects. You could do it based on clients, but again, try to keep it under eight. Alright? So we've the Claude dot m d at the root that tells Claude what the project is.

00:06:37.610 --> 00:07:03.850
The memory folder beside it stores everything that evolves, the decisions, strategies, the summaries, the next actions. You can open up in any platform and it just works. And what I've done for you as well, I've pulled together this project operating manual that you can literally copy and paste. And what I want you to do based on those eight things is come down and essentially create this per project. So this is open opens a folder. So what happens here is that Claude will open this up before it touches any of your code. So it sits as a living

00:07:04.170 --> 00:07:26.840
medium term memory that can change based on what you're working on at the moment. It's the midterm layer of our three level memory system. Okay. So what's included in this folder? Well, we need to explain what the folder is, what the goal is, why does it exist. This exists to get me 10% body fat. This exists to make me x thousand dollars. This exists to help my client do x y z. The stack, what are you building it with? Decisions

00:07:26.840 --> 00:09:33.640
you've already made. Okay? So calls already made so we don't need to relitigate them. It needs to have a memory map about where each individual memory lives and any references that are relevant. Okay? Now the template is simply this. I'll let you basically copy and paste this, but essentially fill this out for everything. Okay? And here's an example of what that might look like based on everything you're doing. Now we wanna keep it under 200 lines because this is your MD. Effectively, what this will do is be prefilled into every single conversation, so we don't want it to be too big. It needs to be updated, needs to be accurate, and it's one folder per repo. So ask all the questions, copy this into these six folders, and then fill it out. Now if you are in Claw desktop app, what you can do in Cowork, if you're using Cowork, is literally use what I would call the projects tab. So you click on projects, this is their version of that. Okay? These can be your six or seven different versions. Awesome. Now, if you're in code, again, I'd recommend code honestly because it's just everything that code does, but like a better and unlimited and it's essentially the same thing. Then in code, all you would do is effectively create these six or seven folders on your desktop. And when you open this up, you can see all the folders on the left hand side. And it's the exact same thing in anti gravity. You can have all of the folders basically, and all you do ever do is you open it up and you open up the folder that you want to chat in based on the conversations that you're having. And so the idea here is that whenever you do work, all you do is you think about which of the seven things is it about. It's about my business. Cool. When I click on business, it's about growing my LinkedIn. Then I click on LinkedIn. And effectively, do all your work inside that project for that specific thing. It has these living midterm strategy that may change. It is what we call mutable. It is subject to change and we do all the work in those seven project folders based on the things that you want to do. And the third level is probably one of the most important which is it's a long term memory. And most people I've seen either drastically over complicate it or don't set it up properly, meaning they get none of the benefits but all the complexity. So level three is the arcade. It basically answers the question, what happened before? What the hell happened before this? So let's get into a little bit detail. Now to do this, we're gonna use one of two systems.

00:09:34.075 --> 00:09:44.395
Option one is gonna be something called Pinecone, which effectively is a big database. This looks really technical. You're probably thinking, Jack, this is way out of control. In reality, they're just five mystical

00:09:44.475 --> 00:09:50.760
bookshelves, which are very, very cool. In reality, what it looks like is you can ask questions to things like this on aiojack.com.

00:09:50.840 --> 00:09:58.120
All of my YouTube videos, any content I ever create is indexed into this beautiful thing in pine cone. It is unbelievably

00:09:58.120 --> 00:10:06.725
easy to set up. And the other alternative we have is, you might have guessed it, something called Obsidian. And Obsidian itself has gained a lot of virality,

00:10:06.725 --> 00:10:09.365
especially a new system, what we call Carpathi

00:10:09.365 --> 00:10:15.685
Obsidian, was considered an alternative to what we use in Pinecone, is retrieval augmented generation.

00:10:15.765 --> 00:10:40.215
Now the idea of this is that it tracks all of the text for any topic you want to. You can amend it. Claude has access to all of it in his markdown files, which is really cool. And it understands basically everything. And the more that you basically use this, the better the system gets because it finds cross linkages between everything. So this one here is a phrase that you can see. This is an example one I have on YouTube, and you can see the relationships

00:10:40.375 --> 00:12:11.840
between each of the individual things. I know you're thinking, Jack, graphs are great, but will graphs solve all my problems? The answer to that question is yes, they will. No. They won't. Of course, they won't. But, yeah, this is the idea of Obsidian, basically. And, obviously, you can do little funny things with it, which is, like, half the fun is moving these sliders up and down, basically. But this is what they call Carpathi Obsidian, which is his alternative to a complex rag system. Now this isn't a right or wrong. You can pick either one of these two systems that you like. Let me help you make an informed decision if I may. So if you look at Obsidian, which is the node one that I showed you, you use this when you want to read and edit the memory by hand. If you wanna physically look at your memories, double click in. For example, I wanna click on this. I wanna see what this says. I wanna come down and read it. Then you may wanna go for something like Obsidian. You can create strategy note, decision frameworks, visual graphs, and backlinks that are important to you and you want zero in for it, you just want the files. Obsidian might be one for you. I'll put a full link on screen right now if you wanna double click and set up Obsidian. I'll go through the entire thing. Alternatively, you have Pinecam. If you want semantic search across thousands of records, this is way more scalable, can work anywhere. You can publish it in apps. It can be accessed from anywhere on the planet. You know, you wanna store wrap ups on every session. If it's for scale, so you wanna store longer information like books and transcripts, it is incredible for that. In other if you want a readable file, use Obsidian. If you want index as searchable summary, use Pinecone. If you're asking what I personally use, I personally use Pinecone just because I don't wanna read through everything in Obsidian, but Obsidian does really have its genuine use cases. Now there are two levels to long term memory.

00:12:11.920 --> 00:12:36.555
One is an archive of any conversation that you've ever had. Now the really cool thing is whether this is in Claude, whether this is in OpenClear, Hermes, or any system you're using, we can, if we're using Pinecone, for example, send all this information to Pinecone and we can index it. So for example, if I have a big conversation with this, all I do is I create a skill. I can come down, do forward slash wrap up, and I have a wrap up skill that takes the entire conversation

00:12:37.060 --> 00:12:55.055
and embeds it in Pinecone. Now if you wanna know how to set this this wrap up skill up, I'll put a link on screen for you so can go through my full Pinecone video that breaks down the entire system for you, my Kapathi video. And then the second level of long term memory here guys is the what I would call knowledge, Deep knowledge. So for example, I might wanna have a YouTube expertise

00:12:55.135 --> 00:13:09.940
database in my pine cone, for example. And again, you could do this with Obsidian if you like to. But the idea here is this one might be YouTube. And this could be all the information that I want on for growing on YouTube or it could be my business or whatever it is. And then what I'm doing is when I'm having conversations,

00:13:10.260 --> 00:13:13.780
your AI agent or your AI conversation can call

00:13:13.860 --> 00:13:16.180
and reach over and conversate

00:13:16.180 --> 00:13:39.980
with this knowledge base when it's relevant. And so in the project folder section section that we covered in section two, what you might say and which is where it all interconnects together. Okay. So let's say for example that one of my folders is Glider. Right? Glider is the speech to text startup that I founded, lets you yap into a computer. In the Glider folder for example, in here, I might say, look, these are the indexes in pine cone I want you to consult

00:13:40.140 --> 00:14:07.380
when I ask you strategic questions. This is my long term memory. In addition to that, if I ask you for any history, consult this index. Then in Pinecone, I might have an index over here, which could be Glider strategy or it could be business strategy or Hormozi or whatever it is, and it will call that long term memory, that expert knowledge and history to help and improve the current conversation. Now to actually build out that long term knowledge, there's two systems we can do. One is to connect this to NotebookLM.

00:14:07.540 --> 00:14:29.550
I'll put a link down below. But for example, I could say something like, hey, go and do some research on business strategy according to Alex or Mercy. Go create for me a notebook in my notebook alarm and just get me like 50 different resources as much as you possibly can and create that notebook for me. Send that straight off. Now what's really cool about this is it will actually curate and you have the power of Claude or ChatGPT 5.5

00:14:29.550 --> 00:14:33.710
going ahead and building those notebooks for you, which is incredible.

00:14:33.870 --> 00:14:39.550
Then we can go over, grab that information, save it on our desktop, and even vectorize it into

00:14:39.710 --> 00:14:40.270
Pinecone.

00:14:40.555 --> 00:15:05.910
Or just like we did with Obsidian, have this beautiful thing over here, which I opened the graph for so you can see because it's very shiny and we like lots of graphs over here. Just use this. Now this is my, you know, Hormozi business strategy thing that I can ask questions to. Again, I'll put a link on screen for the Notebook LM deep dive if you wanna double click and learn more about that strategy. The other one that I absolutely love doing is Firecrawl. So for example, I could say something like this. Hey. I want you to use my Firecrawl integration

00:15:06.070 --> 00:15:14.755
and go ahead and do some deep research for me on Hormozi's best five practices for scaling a business past $10,000

00:15:14.835 --> 00:15:55.260
a month. Okay. Do that. Send that one off. Of course, we can connect Firecrawl to the connectors on plus. Click on connectors over here. You can see I've got Firecrawl installed. And all you literally do is come over, you grab the API key, you can come back over to the connectors, then what you would do is click on the plus, click on add custom connector, and then right here, you just type in Firecrawl, and then remote MCP server, just enter in this right here. Where I've got API key, you just enter your Firecrawl. And then basically, what that will then be able to do is use the FireCall agent, and this can save you like 80% of the cost, and it's just way more efficient in getting accurate data for agents. So what this basically means is we have an archive of every conversation. So every meaningful conversation ends with a wrap up. That summary decisions, next actions, made the metadata,

00:15:55.340 --> 00:16:02.220
the summary lands in the archive. So if I ever say to, hey, what was that thing we discussed about this really important event last January?

00:16:02.380 --> 00:16:20.395
We've got it. And when we do index it, we have the date and we can set all those different filters in Pinecone or locally on a computer if we're using Obsidian, which is really cool. That's the exact process we went through. Then we covered the expert knowledge that we've embedded. We've used Firecrawl to go get some deep research. We can use NotebookLM to build beautiful, detailed,

00:16:21.035 --> 00:16:51.745
impressive notebooks on any topic we want to using the world's strongest AI research bottle, which is freaking incredible. And we can pull that down anytime we want to in our memory system. So we have the three layers. We have short term, midterm, and long term. Level one is who we are. Level two is what we're doing, and level three is everything that's happened before and that perfect knowledge. Now your memory system is only strong as the skills that support us. So what we need to do next is learn what I call super skills that can make your memory system even more powerful, and we're gonna learn that by watching this video right here.
