WEBVTT

00:00:00.000 --> 00:00:26.315
This recent Anray Karpathy post was a bit of a light bulb moment for me. It talks about the strong link between LLMs and knowledge base or in simple words AI and your second brain. It's a simple three layered cake, taking notes, have a place to read them, and then q and a on an ongoing basis. Simple, but it gets complicated if you dive in. Agents don't really have a good place to put pieces of information that we will actually use later. Chat history is not only memory,

00:00:26.555 --> 00:00:49.615
it has some lessons learned. Right? Some for the agent, sure, but some for you, the slave. I mean, the user. Many people just ask it to create a summary MD five, but that's not a knowledge system. If you tell them to store information they gathered as MD files in a central place, just like you're asking them to store code in Git repos, you now have your own handwritten notes along with knowledge you've read as ZLM responses,

00:00:49.695 --> 00:00:51.295
but never actually captured.

00:00:51.455 --> 00:01:03.430
It can now be stored in your second brain. Now you can review this information with Obsidian. By doing that, you're kind of treating it as your IDE. It's used to review and edit LLM text. See the connection?

00:01:03.750 --> 00:01:10.435
Now you can start asking your second brain questions, and that is exactly why Obsidian's new CLI is so interesting.

00:01:10.595 --> 00:01:34.890
And on top of that, I added another open source to bring this concept to life but on god mode because the goal here isn't LinkedIn fluff. It's actually getting around my own personal second brain and using it. So this video is not just Obsidian runs in the terminal now. That's cool. Not the point. The point is Obsidian might be the perfect interface layer between you and your agents, and we're going to take it from this to, well,

00:01:35.210 --> 00:01:37.450
this, a context aware system.

00:01:37.835 --> 00:01:38.875
Let me show you.

00:01:44.235 --> 00:01:52.750
Let's first get the thing to try out before we dive deeper into what we can do, but more importantly, how to do it. If you've upgraded Obsidian recently,

00:01:52.750 --> 00:01:56.190
you have noticed this, an Obsidian command line interface

00:01:56.350 --> 00:02:11.205
where you can pretty much do anything you like with Obsidian through the terminal. Obviously, the point isn't regular note taking, but rather scripting automation and integrating with tools. You'd want the latest Obsidian installer downloaded and updated to the latest release.

00:02:11.285 --> 00:02:23.320
Next, head over to your settings or command comma for Mac users where you'll now see a command line interface activation on the bottom. This button is helpful, but I can tell you it doesn't always do the full job,

00:02:23.640 --> 00:02:31.880
and you may head overexcited to your terminal just to see Obsidian command is not found. Not to worry. The CLI is now under the Obsidian app path,

00:02:32.285 --> 00:02:37.245
but an executable called Obsidian hyphen CLI, which you can symlink to local bin.

00:02:37.485 --> 00:02:40.445
That's it. Pop it open. Oh, and make sure Obsidian's

00:02:40.445 --> 00:02:50.450
open at the same time to see the same vault welcoming you. It's not fancy. It does look like they've used charm to build it and it's a list of commands that you can find and use.

00:02:50.770 --> 00:03:05.205
You can start simple and create a daily note with thoughts, journaling, and tasks if you use these. The syntax alone here already feels like it's not intuitive or user friendly, if I may. You have to set sub commands and add parameters like it's a curl request.

00:03:05.285 --> 00:03:07.845
Nevertheless, new note from the CLI.

00:03:07.845 --> 00:03:09.125
You can read it directly,

00:03:09.205 --> 00:03:12.885
and if you pop Obsidian, you'll see the same note waiting for you opened.

00:03:13.310 --> 00:03:20.350
You can keep appending content to your note like so and the syntax would get translated to proper markdown like Obsidian's

00:03:20.350 --> 00:03:22.270
to do slash done lists.

00:03:22.510 --> 00:03:45.710
But writing notes from the CLI is not very exciting. Sure. It's very helpful for the agent which will soon connect, but let's talk about searching. If you search for something like meeting notes and it finds a bunch, some not exactly what you'd expect from this kind of a search term. Here's why. Before the CLI, your agent searches like this with grep, rip grep or even worse, just scanning files manually.

00:03:45.950 --> 00:03:51.775
That gives you raw matches, no structure, no ranking unless you built it, and of course, no awareness,

00:03:51.855 --> 00:03:53.695
non Obsidian semantics.

00:03:53.775 --> 00:03:59.615
So fast but dumb. With the CLI, instead of here are 200 lines that match Kubernetes,

00:03:59.855 --> 00:04:16.460
you get something closer to no titles, relevant matches. That means your agent can pick pick notes, not lines. Obsidian search isn't just text search. It understands your notes tags, the path in the vault, your front matter fields, the entries on top of your notes. And lastly, probably the most important bit of everything,

00:04:16.620 --> 00:04:17.980
understanding links.

00:04:18.220 --> 00:04:30.215
Now if you actually want to read one of these findings, you'll be very disappointed to find this UX was not made for you. I mean, you'd expect at the very least that the results are fed into a fuzzy list, but nope.

00:04:30.295 --> 00:04:38.930
Not even auto completion when you do get the name right. Literally provide the full name or path to use. You can view it directly or have it pop up in Obsidian.

00:04:39.250 --> 00:04:52.495
One thing I keep noticing with AI tooling is that people reach for bigger models before fixing the surrounding system. If your agent has weak memory pool retrieval or no use for grounding, a larger model usually just gives you more expensive confusion.

00:04:52.735 --> 00:05:12.940
I was reading through Oracle's developer resources and two things stood out. One was getting more out of your smaller language models. The other was agent memory and why so many agents lose context once a task gets a bit longer. That feels much closer to the real engineering problem than generic AI hype. If you're building internal tools, Copilot, or automations,

00:05:12.940 --> 00:06:04.960
the real leverage usually comes from better context, better memory, and better data flow, not just swapping in a larger model and hoping for the best. If you want to dig into that side of the stack, Oracle has a developer resources page with articles, code, and examples worth browsing. Link in the description. Thank me later. And now back to the video. Some commands just shorten the path for automation. So if your tasks are built into daily notes and they should, we can talk about that another time, you can get them very quickly like so. These are daily tasks. If you just go tasks, well, you'll see every single task in your system. You can filter by file. You can actually get the ones that are done or in to do or any status you've set. A morning automation script can open the daily note then add a to do line to check your notes inbox, which is a critical step in the power method if you're following the second brain structure.

00:06:05.200 --> 00:06:08.800
And you could also tell your procedure to follow-up on unresolved items.

00:06:09.265 --> 00:06:36.560
Now this is where intent starts showing. You can find links and backlinks for notes, which basically gives you the mental model of a graph to work with, allowing an automation that collects data to search but also forge knowledge. Hear me out. Links are part of what makes note taking in Obsidian, Notion, and other similar systems so good. You can use them to break down notes or simply link to other relevant written pieces. Implementing the para method in Obsidian or Notion

00:06:36.805 --> 00:06:43.365
relies heavily on these links. You link a resource to a project or an area and maybe another resource.

00:06:43.525 --> 00:06:54.900
You've made the connection because it makes sense. You can now hand over that information to your automation or AI or what have you. When you ask the information, the context is not only bigger, it's smarter.

00:06:54.980 --> 00:07:14.215
When you create knowledge, bringing up the Carpathi post again, you let the agent give you context. Starting to get it? There are some fun ones too, like a random read note that you can have your personal assistant fetch for you every morning or just add it to the morning script. This is a real way to slowly recall old yet relevant notes over time.

00:07:14.535 --> 00:07:19.450
But if there's one thing that LLMs absolutely suck at is understanding

00:07:19.450 --> 00:07:26.570
visual concepts. If you're debugging or building a plugin or just want to fix workflows in Obsidian using an agent through the CLI,

00:07:26.905 --> 00:07:32.425
its API is exposing a screenshot that you can take on demand exposing the full UI.

00:07:32.505 --> 00:07:39.785
Open notes, tabs, menus, whether collapsed or expanded, adding context to the flow you're trying to achieve with an agent running underneath.

00:07:40.340 --> 00:08:14.530
Now if for whatever reason you are picking up notes on your own as a user through the terminal, I've recently shared a video around TV, a fuzzy searchable TUI where you can just pipe over your Obsidian files and enjoy a quick retrieval from the comfort of your terminal. My god. That was a nerdy sentence. I need to put that on a t shirt. TV also has channels where you can configure the preview and different actions with key bindings, like you'd probably want a key to pop a note to Neovim and another to open it on Obsidian or other actions that you can take with the CLI like tagging, aliasing, etcetera. More about TV in the video on the channel.

00:08:14.850 --> 00:08:27.855
Now we've been dancing around the CLI, it's time to actually let the agent use it. Another video I recently made was around PY, a lean agent I've been falling in love with recently. So I'll head over to its packages repo and grab pi obsidian,

00:08:27.935 --> 00:08:38.610
a small extension that adds a skill for the obsidian CLI. You can install it with pi install command or b me and let it figure out stuff on its own, which is an incredible inefficient token usage.

00:08:38.930 --> 00:09:19.360
And just like that, it's now on top of the CLI. By the way, one of the beautiful things about Pi is its ability to change itself according to your needs. So earlier, I asked it to teach itself about the Obsidian CLI. After installing extension package, I actually asked it to compare the two solutions, and until it gives us the result, we can use Py's on demand inline session to query my Obsidian Vault already. Now mind you, this took twenty seconds, not ideal, but in my defense, I am running highest reasoning model here, which may force its hands trying to overkill a simple task. Regardless, the note is opened and PY realized it had done a better job itself by wrapping around the CLI, so it kept its own thing and removed the extension.

00:09:19.680 --> 00:09:32.655
The CLIs in our tool belts and the agents. You can script around it, type results to your favorite TUIs, and then join notes through the terminal. But we've mentioned tokens usage a lot earlier, both the fact that the CLI should improve the consumption, but also

00:09:32.815 --> 00:09:36.930
the quality of usage. Just scanning files is fine, but not extremely efficient.

00:09:37.170 --> 00:09:51.745
The CLI improves capturing notes and improves the results. But if we take a quick look again at Karpathy's previous LLM post, we're now at stage three, q and a. We want to query the knowledge base and maximize both docking usage and quality of results.

00:09:51.905 --> 00:10:19.255
Someone actually went ahead and built it. Graphifi graphifi graphifi is a knowledge graph for coding assistance, and I'm pretty sure this was designed for code repos. So you can improve the questions on top of these, like how the author is implemented, where's the data layer, and so on. It uses tree seater and every context it can find to build a graph that should reduce up to 70 times token usage. Now that graph made me think of this immediately.

00:10:19.255 --> 00:10:22.855
So I thought what the hell, let's run it run on my notes fault.

00:10:23.095 --> 00:10:39.790
Ppex install graphifi with two y's and you can start working. If you run install, it'll create a Claude skill and instructions, but I'm not a Claude fan. It comes with installations for Gemini, cursor, codecs, OpenCode, Ader if someone still uses that, Copilot, Versus Code, Claw Hermes.

00:10:39.950 --> 00:10:55.145
If an AWS has attempted an IDE, Kiro is here. Long list, no PIE. If I pick open code and check the instructions, they added a reference to Graphify out directory and a request to check the graph, report, and index, then update when necessary.

00:10:55.620 --> 00:11:24.920
If you're a PI user, someone took care of that for you and there's a GraphifyPy package ready that adds roughly the same workflow to PI. It'll use the same out path there, remind the agent to traverse it before any search, etcetera. PY install, give it a minute, and we're good to go. We now have graphifi and a matching skill because the extension ships with both. You can use the command from PI or a generic skill command. If you let it run with no instructions, it'll just print the commands itself and other options.

00:11:25.240 --> 00:11:59.030
But we want actual work. Let's build a graph, mister. And after a minute, get this. It actually stopped itself because it's past the intended size. Like I mentioned, this was made to provide answers to relatively small repos with a bunch of files in the system and forcing its hand into something massive. So let's do the entire vault, and I'll try to show you the result in hopes my Mac doesn't start smoking. Over five minutes later, we've got a report, a JSON, Wiki index, and other stuff. The report, while a bit much to go into, looks for what's described as god notes, the notes with most edges,

00:11:59.545 --> 00:12:36.715
most well connected. It also lists surprising connections like launching a course note that I have is semantically similar to what's DevOps. Then there are tags and many other indexes, cohesion calculations of found notes. But here's the fun part. Graph HTML opens this, and while it's beautiful, I don't think you'll learn anything by just watching it. What you can, however, do now is run explain a topic and the system would yield connections for that context. This is basically a reasoning layer that checks the graph and adds a touch of context. If you want proper answers to questions, you can query things like everything I know about Kubernetes.

00:12:36.715 --> 00:12:39.995
You'll get all the relevant nodes ready to compile a response.

00:12:40.235 --> 00:12:48.070
When you finally actually do it from the agent, you'll get response you're looking for properly structured for review, references, and everything you like.

00:12:48.390 --> 00:12:53.830
Lastly, while I'm not one to get excited about benchmarks, surely not from the tool we're benchmarking,

00:12:54.070 --> 00:13:02.325
Grafify comes with its own internal benchmarking tool that tries to analyze reduction of tokens per query based on the index and the graph it's generated.

00:13:02.565 --> 00:13:07.605
In our case, not the 70 x promised, but again, this is a knowledge base, not a repository.

00:13:07.765 --> 00:13:19.680
You be the judge. At this point, you might ask, why not use one of a million AI note taking tools that promise to organize your life, summarize your meetings, write your emails, and probably raise your kids if you upload enough PDFs?

00:13:19.840 --> 00:13:24.000
But I don't want another place. I want the place I already used to become scriptable.

00:13:24.265 --> 00:13:36.105
That's a very different thing. Obsidian doesn't force agents worldview on me. It doesn't say here's your AI workspace, here's your AI memories, here's your AI graph that only exists inside our subscription.

00:13:36.345 --> 00:13:41.650
It just says here are your markdown files. That's powerful because markdown is boring and boring wins.

00:13:41.890 --> 00:13:56.555
Boring means portable, it means diffable, it means I can use git, and I can edit in NeoVim. And if Obsidian disappears tomorrow, I still have my notes. Lastly, I must say, this does not make Obsidian a magical second brain that thinks for you. You can't outsource thinking

00:13:56.715 --> 00:14:19.270
yet. If you don't compile notes and read them, this isn't knowledge. Not only it's not cemented in your brain, it doesn't even get there. A large graph is great, but that's just a fancy way to waste even more token. And if you want the human side of this workflow, the Obsidian and Neovim setup is still one of my favorite videos I've ever made. If you want the agent side, watch OpenCode or Py videos next. Thank you for watching. I'll see you on the next one.