WEBVTT

00:00:00.000 --> 00:00:22.965
So this is Hermes agent, the open source AI agent that lives on your computer and runs your day for you. And in the last thirty days, it just had the biggest update run since launch. Now I have been running it nonstop on my own machine for the last month. I was able to pull a six week old sponsor conversation in under a second when I needed to follow-up. It ranked my whole sponsor inbox in the background while I was filming yesterday,

00:00:22.965 --> 00:01:46.825
and honestly, I have not even touched my email manually in the last two weeks, and half of what I'm about to show you in this video, it was not even possible thirty days ago. Now, here's why this actually matters, because Hermes, it is moving so fast right now that even people running it every single day, they are missing half of the updates. So by the end of this video, you're going to know the nine that actually move the needle, not just flashy demos. I'm going to be giving you guys the ones that quietly will save you hours and money, of course. So here's just a quick taste of what we're going to be getting into. There's one specific update that finally fixes the thing that every AI agent on the planet gets wrong, and there's one that actually lets you swap models mid conversation without losing a word of context. And the last one alone, it could save you a few $100 a month. So there's nine total, six more that I haven't even hinted at yet, and we're going to be getting through all of them right now. And by the way, you do not need to be a developer for any of this whatsoever. You don't need to know what f t s five means. If you can type a slash command and copy and paste and read a telegram message, every one of these is going to be working for you, but let's get into it. By the way, I'll also have all of this stuff available in my free Skull Community. We've almost got 20,000 members, so make sure to check that out if you want all this stuff step by step and more news right away. Hey, really quick, I just wanna mention on June 3 at 7PM Eastern, I'm doing a free live webinar. And basically, what I'm gonna do is I'm just going to walk you through the four AI agency offers that are actually working right now that either my agency

00:01:46.905 --> 00:02:27.230
or my students are actively selling. Now the whole point of this session is you watch it, you figure out which one of these four is going to be fitting you, and then you just go land your first client. It's completely free. It's live. I'm not recording it, so if you don't show up, you completely miss it. And if you do show up live, I'm going to be giving you this thing that I made called the AI offer selection scorecard. It's basically how I would pick the right offer if I was starting from scratch today. So look at me down below in the description. Just click it, grab your spot, and I'll see you on June 3, but let's get back into this. So this first update, it is slash goal, and in my opinion, this is the biggest one in the entire release. Now every agent that I've ever used, it had the same problem. So you give it a task, you have a few back and forth messages,

00:02:27.310 --> 00:03:20.635
and then by message, maybe 10, it's completely forgotten what you had asked for. Hermes, it just completely fixed this. So watch this. I'm gonna set a goal here for the week. So I'm going to say slash goal, draft a YouTube script covering the latest Hermes agents updates, plus three title options for the video, and design the thumbnail by Friday. So I'm gonna run this off. We can see up at the top that we are going to be getting this pinned banner. So this is going to be showing up every time we have new messaging and everything, so it's not going to be losing that specific goal. But I'm just gonna type out a simple message like, can you first break down what the updates actually are? So we see within our output, we're going to get this sort of banner that's going to be pinned at the top of our conversation. In Telegram, it's little bit wonky, and it's going to be more applicable to the actual terminal if you're going to be using Hermes in this fashion. But like most of you, you're going to be playing inside of different channels like Telegram, so that's where I'm going to be showcasing.

00:03:20.875 --> 00:03:55.140
But anyways, you can see this is where it's actually getting a little bit interesting, is that Hermes, it is breaking this into subtasks on its own. So it's going through scripts, and titles, and thumbnails, and actually spun up a separate judge model in the background, and the judge, it isn't the agent actually doing the work. It's actually a second agent whose only job is to watch the progress and decide if I'm actually getting closer to the goal, or if it's just wandering off from that goal. Now right up here, you could see this is where I was asking it to first just break down what the updates actually are. So I just took a pause in the middle of its output,

00:03:55.220 --> 00:04:03.380
in the middle of its, you know, goal run, and it was able to provide me with all that information, giving all the updates on, you know, what actually was released.

00:04:03.540 --> 00:04:04.420
And then

00:04:04.785 --> 00:04:13.425
up next, we can see I was simply typing out something like, okay. What's next? And it just picked right back up right into

00:04:13.425 --> 00:06:26.390
the middle of its goal. So all of this, this is effectively what they call the Ralph loop, so the Ralph Wiggum, as you guys may have remembered it. This is just where the agent locks on the target across every single turn until you actually clear it. If I ever wants to see what the judge is actually doing, I could just type out slash goal status. And we could see, because it already finished up, no active goal. We can set one with the slash goal, but it didn't just finish that. If we wanna do something a little bit different, we could say slash sub goal, make the thumbnail dark mode with one face, no text. Someone's actually generating the thumbnail. It's going to add that specific tweak to it. We can run this off. I'm probably not gonna get any crazy looking thumbnail or anything like that. Now the reason that I'm leading with this one specifically is because I had three videos to ship last week, and I kept getting pulled into sponsor email threads. But what I was able to do is just set a goal once on, you know, Sunday night. By Wednesday, Hermes was still nudging me about which videos were left. I didn't have to keep on telling it. And one thing to know if you are new to all of this, the goal that you write is what makes or actually breaks it. So saying something as simple as, like, build me an application, that is far too fundamental. It's far too broad, and you're just not passing enough information. So in that case, like the judge, it has nothing to be checking against. But anyways, that is the first update for this command set, and you'll notice that Claude and OpenAI and all these other tools, OpenCloud, like they're releasing their own forms of slash goal. So this one, from what I have seen, is one of the best. Now number two, this is the memory upgrade. And honestly, this one is about three different things rolled into one. So your agent, it now is going to remember every single conversation that you have ever had with it. So So it indexes every tool call it's ever run, so you can ask what command that it used, maybe last Thursday, and it will find it. And it also catches the whole thing across all the sessions, so you stop paying for the same context twice, which is a huge problem up till now. And the headline of all of this is the session recall. So let me show you what this actually looks like. So I'm gonna say pull our last conversation about our morning brief skill. So this is like way at the top of our conversation. We were talking about just setting up a morning brief skill, so let's run this off and see what we can get back with this. And this is being pretty brief, but anyways. So what it's doing right here, it's going through the session search, and it's just trying to recall the morning brief skill, morning whatever. So we're now getting our output back. It is pretty lengthy because it was a pretty lengthy conversation.

00:06:26.655 --> 00:06:34.495
Anyways, if we scroll all the way up to the top, we can see session search. It's doing a recall of the morning brief skill. It's trying to find exactly

00:06:34.655 --> 00:06:58.555
and, you know, just retrieving all the information. The session didn't find or return a stored prior thread, but the morning brief discussion is present in this current Telegram thread. I'm checking the live Chrome slash scripts so I can pull the actual setup state, not just the chat recap. So it's going in pretty long depth, and it's able to find exactly what I had asked for in the conversation. We can see the output format that I was asking for. If we go down below,

00:06:59.035 --> 00:07:05.835
we can see what we noted. So the calendar plus Gmail, we're pulling real data, what is working, so everything that is set up, what was excluded,

00:07:06.235 --> 00:07:14.640
and some button logging. It's not fully wired, so this is exactly what I was talking about in that conversation. And we can get way more in-depth than this. Like I mentioned, this is pretty fundamental.

00:07:14.800 --> 00:07:41.480
But if we're going to get something across a different session, we can absolutely do so because the memory, it is now effectively buffed. And one of the best things about this is it quite literally needs zero setup, So you do not have to do anything as long as you are updated to the latest instance, then you are all good to go. So the recall, the cache, the tool call history, all of it, it is going to be on by default. It just works. Now number three, this is slash background. So this is the one that finally fixed any sort of multitasking.

00:07:41.480 --> 00:08:25.705
So to give you some context, most agents, when you give them a task, they're going to be pretty busy. So you can't ask them anything else until they're going to be done. But with slash background, your agent can be working on five things at once, and still chat with you in the foreground like nothing is happening. So it's going to be somewhat similar to the slash b t w inside of Cloud Code that you may be familiar with. So watch this. I'm gonna fire three background tasks. So I'm first going to say, slash background, read the last 20 sponsored emails in my Gmail, and rank them by deal size, brand fit, and response urgency. Now I'm going to say, slash background, check the latest releases from Anthropic, OpenAI, XAI, DeepSeek, and summarize what shipped in the last seven days. And now lastly, what I'm saying is research the last seven days of YouTube videos from the top AI creators. Pull the titles, topics, thumbnails,

00:08:25.705 --> 00:09:04.440
view counts, and then identify the strongest emerging content patterns, repeated angles, and breakout trends that we can use for new video ideas, prioritize actionable insights over raw data. Boom. Now everything, it is currently running. It doesn't look like we have gotten a response back from any of them thus far. Totally fine. But whilst all of this going on, I mean, I can just keep chatting in this foreground. So I can say whatever I want, and it's going to have all of these still processing, still running for me, all in the background. Alright. So just like that, we are getting our first task. So it's going to rank all of our sponsors. We could see we have starting from the top, we have Base forty four, Asana, and then all these other people who are reaching out for sponsorships,

00:09:04.600 --> 00:09:46.315
so on and so forth. But whilst all this is going on, we have our other two that are currently running in the background as well. Now you'll notice that each one, they are going to get their own task ID, so this is just going to be used for referencing at different points. So if you ever forget or maybe some things are a little bit similar to one another, and you can't, you know, discern which one is which, you'll be able to identify them and reference them through this task ID if ever necessary. Alrighty. Now number four, this is going to be the auto Kanban. So Hermes, they have had this Kanban board for a while now, but what's new is you can drop a raw idea into Triage. In Hermes, it's going to be able to flush it out into a full spec, break it into subtasks, and just assign them out to sub agents all on its own. To actually spin this up, we could just type Hermes dashboard.

00:09:46.315 --> 00:10:15.065
And then a few seconds later, it'll automatically populate the dashboard for you. And we can just navigate to the Kanban, and you'll notice there's going to be four different columns. So we have the triage, we have the to do, we have the scheduled, ready, and the thing that makes the magic happen, it is right here at the top. So the orchestration, it is currently set to auto. But watch this. I'm gonna drop one raw idea into the triage. So how you actually do this, if you just click on the triage, click this little plus button, you give it a rough idea,

00:10:15.305 --> 00:10:39.685
and we could say something if you could actually expound this expand this rather. Prep tomorrow's filming day. List every video I need to shoot. Draft a thumbnail concept for each one, and then write three title variants per video. We could provide it with some of the skills that we want to reference and utilize specifically just to make sure that's not going to mess anything up. But we're just going to click on create. And just like that, you could see it's now listed under this specific column, and we could click on open.

00:10:39.765 --> 00:10:42.645
We have all these different options, like specifying, decomposing,

00:10:42.645 --> 00:12:04.315
moving it to ready, like all the other columns If we wanna move it into there, we can notify any of the home channels. And here, we have some of the dependencies and some of the children as well. Now automatically, it was just moved into the next column, so it's now moved into the to do. So the specifier inside of Hermes, it's grabbing that raw idea, and then flushing it out into a real spec, and then it's just going to be breaking it apart. So now we can see some of the things that are in progress. So compile tomorrow's video shoot list. So it's broken that apart. It's now doing this first, and then it has some other tasks that it has to do later on, you know, just making sure that it's delegating and assigning everything by priority. So if it has to do this first, it's going to make sure to start that task first, and then go to the other task that it had already broken apart. So with this, we have tons of different sub agents, and they're all about to start working in parallel. Now my personal favorite use for this, it's video research, at least from what I've been using it thus far. So when I'm digging into new topic for this channel, I drop a one line brief into this triage right here, something like just find everything that shipped in AI agents this week, pull the top 10 examples, and then rank them by the impact. And then Hermes, it's going to split all of them out into a bunch of different parallel research tasks, sends a sub agent at each individual one. By the time I'm back from filming, it's all going to be stacked up. It's going to be in this ready section all the way to

00:12:05.035 --> 00:12:17.680
wherever it is, ready right here. And then this right here, like, this used to take me a full afternoon, and now it's going to be done in the matter of time that it takes me to shoot just one video. But, yeah, with all this, make sure your orchestration is going to be set to automatic,

00:12:17.920 --> 00:12:31.615
and you're not going to be flicked on the manual. Very important. Alrighty. Now number five, this is going to be one of my favorites, computer use. So Hermes, they have had computer use before, but it only worked with Claude. And as of about v o point 01/04,

00:12:32.020 --> 00:12:51.345
it works with every Vision capable model. So like GPT five, which is what I'm using right now, Gemini, Grok Vision, literally all of them, which means whatever model you're already paying for, you can drive your screen for you. So all this means whatever model that you're already paying for, it can drive your screen for you. So how you actually set this up, just open up Hermes

00:12:51.345 --> 00:12:52.065
tools.

00:12:52.465 --> 00:15:58.015
And from here, it's going to automatically populate the tools section, and it'll automatically populate this little terminal tab right here. What this goes to reconfigure an existing tools provider or API key, and just make sure that computer use is actually enabled. So you can press space and make sure that everything's good to go, and then we can scroll down to click on done, and then we'll just back out of this. Alright. So now check this out. What I'm gonna say is use my browser to open up ClickUp and find today's tasks for me. So I actually had to restart my terminal and give it access and the proper permissions to do all of this, but right now it's asking me to log in. How would you like to proceed? So this is where I would have to provide it with the credentials. I'm just gonna click on one, and we'll do all of this manually. So now we can see I'm not touching anything. It's going to automatically open up ClickUp for me, and it should navigate everything. So just like that, it's automatically navigating to my to do list. Now it's clicking on each individual task, so this is the first task. Slowly but surely navigating to one of my other tasks, which is booking in for a doctor and the dentist. But anyways, it's just going to navigate through all of this just like a normal person would. Now the reason that this is a bit bigger than it actually sounds is because I had ClickUp open up at home, so my desk that I'm using right now, and I forgot to close out a task before I left for a meeting up with a friend. So from my phone, I was able to just text Hermes and told it to mark it as done. And just like that, I didn't have to pull up my laptop or anything like that. It was able to control my computer and handle everything for me, use computer use, and, you know, open up the necessary tabs, close them out, and do anything else for me. Alrighty. Number six. So So Hermes, they have this background agent called the Curator, and it's hands down one of the smartest things that they have shipped this entire release. Nobody's really talking about this. So what it does is every seven days, it goes into your skills folder, and it cleans the whole thing up. So it's doing some sort of self maintenance. You literally do not have to do anything. So watch this. I'm gonna pull up what it did the other week. So I'm gonna say Hermes curator status, and just like that, we now have a ranked list of every skill that I have. So this is a brand new fresh Hermes instance, so we don't really have much. Well, actually, we only have one. But normally, you'll have the ones that you use every single day. It's going to be up at the top, and the ones at the bottom, they're going to be somewhat deadweight. So that's going to be stuff that, like, you totally forgot about. But now if we just check out down at the bottom, we can see that the curator, it is enabled. It's seated, but it hasn't completed a real run yet, and it's now going to be running every seven days unless you manually preview something. So you don't really have to manage this. It's just gonna be running automatically. So for you and your use case, if you're using Hermes quite frequently, and you're using it over the course of weeks and months, in this case, it's just going to, you know, prune a bunch of the dead ones, and promote the other ones to the top all on its own, all while you're sleeping. Now number seven, this one is actually pretty awesome. This is the native video generation. So your Hermes agent, it can now go text to video or even photo to video all inside your telegram or in your Hermes agent all natively. There's no popping over to a separate website, no signing up for another AI tool, and, you know, having to worry about the 50 others that you're already paying for. So all those different ads that you see on YouTube and Instagram, the ones that are like, sign up for our crazy AI video tool, only $50.90

00:15:58.015 --> 00:16:44.720
dollars a month, generate the most cinematic videos, whatever it may be. This is kind of the same thing, except it's just built straight into Hermes. So as long as you've got your Grok account hooked up, you can do it right from your chat. For me personally, I don't have Grok hooked up, so I'll show you better yet how to actually set all this up step by step. So I'm gonna say generate a five second clip of a robot bartender mixing a cocktail. Alright. Now number seven, this one's actually pretty awesome. This is the native video generation. So your Hermes agent, it can now go text to video or even photo to video all natively inside of Telegram or even inside of the Hermes terminal. So there's no popping over to any separate websites, no signing up for another AI tool on top of the 50 that you might already be paying for. So you might have seen, like, all those ads on YouTube, and the ones that are, like, sign up for our crazy AI video tool, only $97

00:16:44.720 --> 00:17:50.355
a month. Yeah. This is kinda the same thing, except it's just built into Hermes. So as long as you do have some sort of provider, so either if you're using Grock or something like FEL, you just provide it with your API key. So I'm going to give it mine from FEL. I have my API key at the top, but what I'm also saying is generate a five second clip of a robot bartender mixing a cocktail, and we'll run this off. So while this running, is because it will take a couple of minutes or so, this is gonna be great for any b roll for videos, or any intro stings. Just little clips for social media, or maybe any animations for your thumbnails, you know, the stuff that used to either you had to film yourself or just pay an editor to make, or you can just pay another AI $12.30 bucks a month. But now you can just type a prompt inside of the chat they're already using, and it'll be able to spin it up extremely simple and very cheap. Alright. So we just got our video back. It took maybe about four minutes or so, but we have our five second clip natively inside of the chat, exactly what we're looking for. Now if you did want to use this through Grock, you would have to enable super Grock and have this connected because it is going to be running straight off of Grock. So how you can actually enable this yourself, if you did want to do it, you just have to open up Hermes tools once again. And then you'll just have to open up and enable the video generation,

00:17:50.435 --> 00:18:04.880
and then sign up and sign in with your Super Grok account once. That's all it takes. Takes literally like thirty seconds. After that, it'll work every time for you. You don't have to rely on different platforms or providers like Falle dot ai or Higgs Field or anything like that. Alright. Now number eight, this one is legitimately

00:18:04.880 --> 00:18:10.480
going to save you a lot of money if you are following my advice properly and using this practically.

00:18:10.640 --> 00:18:35.890
So with this, with slash model, you can just swap models mid conversation without losing a single word of context. You'll have the same chat, the same thread, just a different model answering, just like using Claude code. Now the biggest reason this matters is because most of the time, you don't actually need Opus 4.7 or GPT 5.5 like I'm using right now, or whatever the most expensive model is. Most of the stuff that you are doing, it's probably just clean up, or formatting, or quick look ups, or like simple summaries.

00:18:35.970 --> 00:18:50.605
So tasks that a much cheaper model could handle just as well. So you're probably paying premium models for stuff that a smaller model could do for a fraction of the cost, or literally zero if you're running models locally. So watch this. I have GPT 5.5

00:18:50.605 --> 00:18:54.845
running right now, and I'm just going to say slash model 5.4.

00:18:54.845 --> 00:18:59.910
And just like that, we were able to swap to GPT's lower and cheaper model, a 5.4.

00:18:59.910 --> 00:19:07.750
We could also go to any other models and go to completely different providers. So if we have different fallback options and everything connected inside of Hermes,

00:19:07.750 --> 00:19:09.270
maybe we have Anthropic,

00:19:09.270 --> 00:19:10.390
or maybe we have

00:19:11.110 --> 00:19:11.990
OpenRouter,

00:19:11.990 --> 00:19:18.785
or DeepSeq, all these different model providers. We can just easily select slash model, and we can just go to Anthropic,

00:19:18.785 --> 00:19:21.745
or we can go to something like DeepSeq,

00:19:21.825 --> 00:19:24.465
if I can spell properly, or even OpenRouter,

00:19:24.945 --> 00:19:40.320
so on and so forth. So this is where you can save a significant amount of money. And something you'd also do is you can assign Insta Hermes and say something as simple as, I want you to determine which models are going to be the best model for the job. So in this case, if we have to go to a

00:19:40.480 --> 00:19:47.915
higher model in which we're doing a complex task, then automatically do slash model and switch to 5.5

00:19:47.915 --> 00:19:50.715
instead of relying on the lower tier models like 5.4.

00:19:50.955 --> 00:20:14.425
Let me just type this out. Hermes will automatically be able to set this up for us. Alright. Now the last one, number nine, if you do any vibe coding at all, this is the one that's gonna save you the most money. So Hermes, right now, it can natively use Codec's CLI as a worker, which means that Opus, it stays your main brain for the thinking and the planning, but the actual line by line coding, it gets handed off to Codecs, and Codex runs on your ChatGPT subscription

00:20:14.425 --> 00:21:04.320
instead of your Anthropic API. It's a different build, but the same workflow. So just a quick heads up. For this to work, you do need Codex CLI installed on your machine and signed into your account, your ChatGPT account first. It takes about thirty seconds. I've already got mine set up, so I'm just gonna drop the prompt and the commands, and show you what actually happens. And the install steps are gonna be linked inside of our free school community if you need them, but you just have to install through NPM, Codecs, and then just simply type out Codex inside of the terminal. So again, check that out inside of our school community. So I'm back inside of Hermes, just using Telegram, of course. And I'm gonna say use Codex to build a one page landing site for an AI boot camp in a single HTML file, dark theme, three pricing tiers, and use Tailwind. So now I'm just going to press enter. So now Hermes, it sees the use codex part, and it's going to route the work over.

00:21:04.640 --> 00:21:22.365
So it's going to just notice the skill view codex. Alright. So we just got this wrapped up, and let's now open this up and see what it actually populated just using codex. So just like that, we now have our landing page. We have the dark theme pricing section. Okay. So we have 1,800, 3,200, and 9,500,

00:21:22.365 --> 00:21:31.550
the whole thing. And the part that matters the most, the entire build just ran on the ChatGPT side of my stack. So my Anthropic build, it didn't even move a millimeter.

00:21:31.630 --> 00:21:36.030
Now the way that I think about this is if you've got two specialists on call,

00:21:36.270 --> 00:21:38.270
Opus, it should be your strategist.

00:21:38.575 --> 00:22:54.885
Codex, it should be your builder. And you don't even have to pick which one does what part. You just say use Codex when you want Hermes to delegate the build, and it routes the work for you. Anyways, guys, that is the nine. And if I had to pick the four that I think most people are actually sleeping on, it's gonna be the slash goal, the curator, the slash model, and the native codec. So those four, it's going to quietly save you the most time and most money. So make sure you set those ones up first. And one last thing, if you want every single one of these features, all you need is the latest Hermes setup. So if you haven't updated in a while, just run Hermes update, your terminal takes literally a minute, and everything I just showed you in this video goes live. And the thing with all of this, I told you at the start that these updates, it quietly saves you a bunch of time and a bunch of money, but watching me do this, it's one thing, and actually wiring it up to your own machine, it is another. So I built a community where we break every one of these down step by step, and we have 18,000 other people already in there building the same stuff. It's the first link in the description. It's completely free. Come drop a comment when you are in, and tell me which of the nine that you set up first, because I genuinely want to know which one nine's for people. And if even one thing in this video saved you time, just hit subscribe, drop a comment. Really appreciate it. But every single week, I will be posting videos just like this and bunch of other things. So make sure to check that out. Check out the links down below in the description. Thank you guys for watching. I'll see you in the next video.