WEBVTT

00:00:00.080 --> 00:00:21.585
Max Hermes is the best AI agent that I am using right now. I'm gonna be completely honest with you. I'm running Hermes agent for about 90% less than what most people are paying for it. In the deployment, it literally takes ten seconds to set up. There's no server. There's no docker. There's no model wiring. And And the reason that this is such a big deal is because Hermes agent is the fastest growing AI agent of 2026.

00:00:21.665 --> 00:00:27.345
They have over a 150,000 GitHub stars in ten weeks. It's faster than LangChain, faster than AutoGPT.

00:00:27.345 --> 00:00:45.235
So half the people that I know and who try to set this up on x, they're never doing it right or they just give up. And what Max Hermes does is it just takes the exact same Hermes underneath. It's the same learning loop. It's the same memory system inside of it. It's the same skill library and it just wraps it into a cloud running on a model that costs 5%

00:00:45.235 --> 00:01:52.995
of what Cloud Opus costs per token. So it's the same agent. It's just way less friction, way less money. And in this video, I'm going to show you how to deploy it in ten seconds, how to connect it to your Gmail, how to run a workflow live, and how to save what it just did as a reusable skill so the agent gets every time that you use it. Now I do not care if you've never run an AI agent before a day in your life. By the end of this video, you will have your own twenty four seven AI employee running for about $19 a month. Alright. Now real quick before I open up anything, let me just set the table of why this is actually such a big deal. Because if you're anything like me, you've probably been bouncing between AI agents just trying to find the right one. So Claude Code, Codex, OpenClaw, Hermes agent, every couple weeks, there's a brand new one that's supposedly the best and people switch. Now I've used all of them, and most of you watching have probably tried at least two or three of them. But you notice the actual problem with that is that every time you switch, you're starting over from zero. So whatever you taught QuadCode about your business, your code base, your client process, your writing style, it's gone the second that you open Codex. And whatever Codex had learned about you, it's gone in OpenClaw.

00:01:52.995 --> 00:02:02.090
And the agent, it doesn't follow you. Now before anyone in the comments starts yelling at me, yes, these tools have memory. I know. ClawdCode has Clawd. Md

00:02:02.170 --> 00:02:12.545
and files and skills and hooks and ChatGPT has its memory feature. They all remember things, and I'm not saying that they are stateless. What I'm saying is it's memory you manage. So you write the MD,

00:02:12.625 --> 00:02:57.760
you decide what to remember, and you effectively are building the skills by hand. You rebuild the context every time you switch tools, and that's just manual memory. In Hermes, it's different. Hermes writes its own playbook just from doing the work. It watches the task succeed. It decides what's actually reusable, and then it saves it as a skill without you asking. And the agent, it compounds itself every time that you use it, and that's the actual gap. It's not memory versus no memory. It's manual memory versus autonomous memory. And honestly, once you watch this happen on screen for the first time, you probably can't unsee it. Alright. So let's open this thing up. So this is the MiniMax agent. Their team, they frame it as a twenty four seven all in one workspace. And honestly, the productivity layer, it is really mature from what I've experienced. So inside of this, we have the experts,

00:02:58.000 --> 00:03:03.600
we have different skills, we have the office, and we have tools and image and video generation,

00:03:03.920 --> 00:03:22.535
even some web tools here. Now, most platforms, they usually just ship like one of these features and mini max it has all of them in one place, which is why I have been very fond of it. Next up, you see we have a few different options here. So we have Max Hermes and we have MaxClaw as well. So this is just the other flagship inside of the MiniMax agent. So MaxClaw is just the cloud version of OpenClaw,

00:03:22.535 --> 00:03:31.650
which is, of course, as you know, one of the most popular open source AI agents on GitHub right now. They have over a 100,000 stars. And the whole idea behind OpenClaw

00:03:31.650 --> 00:03:46.225
is that your agent, it lives inside of your chat applications. If you live under rock and you don't already know what OpenClaw is. So different channels like Telegram, WhatsApp, Slack, Discord, instead of you just having to open ChatGPT in a separate tab every single time, you can just text it like a contact,

00:03:46.385 --> 00:03:59.360
it text back, and it has persistent memory across every conversation that you actually have with it. Now what most people do when they are first setting up OpenClaw or Hermes is they're going to be running this locally. Now the underlying issue with running this locally

00:03:59.520 --> 00:04:08.935
is you're going to have to set up the local gateways, the channel connectors, the model wiring, and even doing the memory storing. So it's just a whole setup project

00:04:09.015 --> 00:04:11.415
just to get it running. Now MaxClaw,

00:04:11.495 --> 00:04:12.375
MaxHermes,

00:04:12.375 --> 00:04:20.670
it fixes all of that. So it's quite literally a ten second setup. There's no Docker, there's no servers, and it's just one click to actually connect your messaging.

00:04:20.990 --> 00:04:48.600
And the agent, it lives inside of your inbox. So I'm not gonna go deep on MaxClaw today because Max Hermes, this is going to be the one that we are focusing on this video, and it genuinely is changing how I work. But if you've been wanting to run Open Claw and bounce stuff off of the self host as well, Max Claw is going to be your shortcut. But anyways, back to the flagship here. So again, Hermes, this is going to be the one that learns from how you work and this is what has been running for me for about two weeks at this point. The MaxHermi, it runs on a model called m 2.7.

00:04:48.600 --> 00:04:56.040
It's 30¢ per million input tokens. It's a dollar 20 per million output. Now if we compare that to Opus 4.7,

00:04:56.280 --> 00:04:57.720
which is $5

00:04:57.720 --> 00:04:59.725
per input, $25

00:04:59.725 --> 00:05:01.885
per output. So m 2.7,

00:05:01.965 --> 00:05:07.645
it is roughly 17 times cheaper on input and 21 times cheaper on the output per token.

00:05:07.805 --> 00:05:15.530
Now is m 2.7 the best model on the planet? I'll be completely honest. No. It's not. Opus 4.7, it still leads on the hardest coding benchmarks.

00:05:15.610 --> 00:05:16.890
So it's 64%

00:05:16.890 --> 00:05:17.930
on SWE

00:05:17.930 --> 00:05:21.770
bench pro versus m two point seven's 56.

00:05:21.850 --> 00:05:40.805
So Opus, it of course still wins the leaderboard. Opus 4.7, it's just the best model on the planet out there, but it's very expensive. You do not need it for 90% of the task that you are actually doing. So for what an agent actually does day to day, like reading inboxes, drafting emails, classifying threads, maybe even building some applications,

00:05:41.045 --> 00:05:47.310
or just automating some processes, you would not notice the difference between Opus 4.7 and m 2.7.

00:05:47.470 --> 00:05:57.355
So the only difference is your bill at the end of the month. And for an agent like Hermes that's constantly running tools and rereading its own skills, swapping Opus for m 2.7,

00:05:57.515 --> 00:06:21.140
it's not just any small savings. It's literally the difference between an agent that you can afford to leave running twenty four seven and one that you cannot. Let's go through the full workspace, the hero agent, and I'll get a model built for this exact use case, but let me show you how to deploy it first. By the way, for those of you again who do live under a rock and you're not familiar already with what Hermes is, it's just an open source agent built by a research lab called News Research.

00:06:21.635 --> 00:06:59.745
So the tagline that they actually have, it's just an agent that grows with you. And this is the project people points to when they argue about whether AI agents can autonomously learn, not just store the notes that you wrote them. So the pitch, it's pretty simple. You give it a task, it executes the task, and then it writes a little playbook just describing how it solved it. And the next time you give it a similar task, it pulls that playbook off the shelf and it runs even faster. So the agent, it's going to be getting sharper and sharper every time that you are using it. Now this all sounds great, but the problem is just running it yourself. So you need Python setup, you need API keys for whichever models that you are plugging in. And if you want it running as a real service instead of just a terminal application,

00:07:00.065 --> 00:07:03.345
well then, you then have to just spin up Docker or VPS

00:07:03.345 --> 00:08:19.395
and have the people I know who looked at running it, they just bounced before they got a single working task up and running. And they just don't have a practical and successful system and they usually just go back to whatever they were using before, which isn't as efficient. This is the problem that Max Hermes solves for along with the pricing problem. Back inside the platform, I'll also have a link down below in the description to sign up. Also, you'll have the full guide inside of my free school community, so make sure to check that out if you're not already in there. So we'll just activate the Sandbox instance. Gonna start now and we'll click on confirm and pay. We just have to activate our instance. So we do have to have a base amount of tokens actually already in there. And just like that, in maybe two seconds, we already have Max Hermes set up. So again, with all of this, this is literally the same Hermes underneath. This is the same skill creation loop. It's the same memory system, the same idea. Now that we have our fresh chat ready to go, we have our skills section on the left hand side. This is just where any playbooks that the agent builds will live. So the current processes will be on the right hand side. If we just open this up, of course, we don't have anything up and running just yet because it's brand new. But this is where you'll watch the agents just work in real time once we do give it something to do. But let me just go ahead and make this actually useful. I have to give it access to my tools. So to actually do that, we're going to be using something called MCP,

00:08:19.395 --> 00:08:35.530
auto contacts protocol. This is just the open standard for agent to tool connections. So you can just point the agent at an MCP server, and the server handles all the actual tool calls. So for this video, to make it as simple as possible, I'm using Zapier's MCP server because it covers the broadest list of applications.

00:08:35.690 --> 00:08:41.130
So there's about over 9,000 applications and different integrations. So Gmail, Slack, Notion, Linear,

00:08:41.370 --> 00:08:57.755
HubSpot, I mean, Calendly, Stripe, mean, you name it. If you are using it, they most likely have it. So if you've got a Zapier login, you already have the connection layer for Max Hermes. So it's just a few different steps you have to actually go through. So to get started on that, I'm just gonna load up mcp.zapier.com

00:08:58.200 --> 00:08:59.480
and we're just going to

00:09:00.040 --> 00:09:20.845
log in to our account. Now what we can do is we just click on new MCP server. We could scroll down and just click on other. And there's gonna be a few tabs that we wanna focus on. So there's the applications and there is the connect tab as well. So apps, this is of course where you just wants to add all of your different applications. So you can literally search through all of the different apps they have. So I mean, you just think of a weird one, maybe it's, um,

00:09:21.165 --> 00:09:21.885
Jira.

00:09:22.445 --> 00:09:24.205
They have Jira on there or,

00:09:24.365 --> 00:09:38.140
um, HubSpot, of course, they have all the popular ones. And we can get a little bit more specific, so maybe go high level. I think they call it something different like lead connector. Yeah, just like that. Here's go high level. And then of course, we have all the other important ones. So maybe it's QuickBooks,

00:09:38.300 --> 00:09:40.460
check for that. We have QuickBooks and

00:09:41.020 --> 00:10:06.280
maybe another one we could throw in there. Let's think of Fireflies. There we have Fireflies. So what I'm going to do is I'm going to keep it pretty bare bones, pretty fundamental and we're just going to connect the most important tools. Now I wanna give it access to all these different permissions, all these different scopes. So I'm going to click select all tools and then connect and then just like that, we just have to log into our Gmail account. There's really no configuration we have to do there. Now from here, let's say we have all of our tools connected, we can just go into the connect section

00:10:06.520 --> 00:10:13.595
and this is where we have to a brand new token. So we can just copy the full URL with the token embedded.

00:10:13.595 --> 00:10:15.115
So let's grab this right now.

00:10:15.595 --> 00:10:16.795
Grab both of these.

00:10:17.195 --> 00:10:20.715
Actually, we'll go to option two and grab this full URL here.

00:10:21.570 --> 00:10:36.275
So this URL, it's just going to be the bridge between Max Hermes and Gmail. So we're going to feed it directly to the agent in just a second. But what's actually worth understanding is that we're not just connecting Gmail. We're connecting a server that can actually route to anything in this 9,000

00:10:36.275 --> 00:10:46.275
application library. So if you want Slack later, you just add it on this Zapier side. The URL, it's always going to stay the same. You just have to make the change on the actual

00:10:46.990 --> 00:10:54.990
application section. I've saved my credentials. You would just have to make the changes here. You don't have to get new URLs or anything like that. It's extremely simple to manage.

00:10:55.710 --> 00:11:06.585
Now back inside of MiniMax, we're just gonna go to the new task section. We first want to configure our MCP. So how we do this is we just click on this little settings section. We could go to manage MCP,

00:11:06.585 --> 00:11:18.630
and we either just connect what applications they have here. They only have like five. Well, a little bit more than five, but we click on custom beta. We'll just call this Zapier and we paste in our URL that we just copied from Zapier.

00:11:19.110 --> 00:12:40.995
We'll click on confirm and then we'll have to do one more thing. Now back inside of our Hermes agent, I can now type out connect to the Zapier MCP server to access Gmail tools. Once you are connected, pull every client onboarding email that I've sent in the last thirty days, find the ones where the prospect went cold, and after this second touch, draft personalized reengagement emails referencing what we last discussed and queue them as drafts in my Gmail. Now I have found at times that connecting to this MCP server, it does take a few iterations and back and forth with the agent itself, but, you know, after about five different chats, then you'll should be able to connect to it. It is just a little bit finicky. But anyways, here's what we got back. We have a real draft. All of this, of course, it's personalized to the actual conversation that I had with that prospect. It's now sitting inside of my drafts folder inside of Gmail and it's ready for me to review and then send off. So it's just one prompt. We have the MCP, we have the agent and the Gmail tools all working together now. Now this next part, this is what makes Max Hermes different from every other AI tool that I have used. So the agent, it of course, as you just saw, just did this task. But by default, the next time that I ask it to do something similar, it would start over from zero, unless you have actively built memory for it by hand, like writing a Clawdet MD or creating a custom skill. So watch what happens when I just tell it to remember this workflow. I'm gonna type out save this as a skill called Gmail called lead reengagement.

00:12:40.995 --> 00:12:49.840
Call it whatever you want. It doesn't really matter. But you can see here's the response that we're going to get back, and that is literally it. So the agent just wrote itself a playbook.

00:12:50.000 --> 00:13:31.965
So if we just navigate over here into the skill section, this is now sitting inside its library. So it's available the next time that I run a similar task. But the part that's actually wild is if you watch what agent actually says that it kept and what it deliberately left out. This is the part that no other chatbot is actually doing on its own. So the agent, if you actually go through this, it's reasoning about what is actually reusable in this workflow. So it's the structure, it's the search logic, it is the drafting steps, and the way that it queues things in Gmail as well, all of that is going into this skill. But the voice and the specific phrasing and the tone of the email itself, that is not going to go in because that would just make the next batch sound,

00:13:32.540 --> 00:13:54.465
I would say, stale. So the agent is not blindly saving everything I did. It's saving the parts that scale in. It's dropping the parts that should be fresh every time, and that is autonomous memory. And it's the reason that a $19 per month tool starts to feel like the best money you have ever spent on AI. So here's how to actually be thinking about this high level. So most AI tools, they have memory that you manage like claw.md,

00:13:54.625 --> 00:14:31.795
JHBT's memory list, cursors history, and you just decide what gets remembered. Right? Max Hermes, it has three different layers, and that's just Hermes in general, and you only manage one of them. So layer one, this is what you said. This is the chat history. So this is very standard. And layer two, it is the agent's reasoning over what actually worked. So which steps mattered, which API calls succeeded, what edge cases actually came up, and the agent, it records this on its own. Layer three, this is the skill itself. So this is the reusable playbook that the agent decided was worth saving with the parts that do not generalize stripped out. So this is also automatic.

00:14:32.070 --> 00:15:05.490
So after maybe a month of using this, you have gotten a library of skills written from how you actually work, and it's not just how MiniMax thinks you works. It's not how some prompt template thinks how you work. It's how you actually and genuinely work, and that library, it is going to be yours. Now a couple more things that I do want to touch on. So the skills we have briefly covered earlier in this video, but I mean, there's just hundreds and thousands of different skills that you can actually utilize. So you can go to a humanizer, writers. I mean, these are all just different things from GitHub. We have the flow diagram expert, which is going to help you create flowcharts.

00:15:05.570 --> 00:15:17.925
We have notebook l m. This is going to be able to query notebook l m notebooks directly from Claude code. I mean, there's so many different options. We have a Claude code harness. We have an SEO audit where it's going to analyze

00:15:18.005 --> 00:15:20.245
your crawlability indexation,

00:15:20.405 --> 00:15:22.005
speed on page optimization.

00:15:22.005 --> 00:15:36.440
I mean, so much different options. You can be providing these all into your Hermes agent, so it can just do more and more and more for you. Now the last piece that I wanted to cover before we actually wrap up is you do not have to be at a key for any of these tasks that you want to run automatically.

00:15:36.600 --> 00:15:39.305
So what I mean by that is hypothetically,

00:15:39.305 --> 00:16:01.780
every day or every Monday at 9AM, you want to scan new inbound leads from the last seven days and then qualify each one against my ICP filter. And then from there, like, can just queue the qualified ones as Gmail drafts with a personalized first line referencing their company. So we could type out in plain English every Monday at 9AM, scan new inbound leads, and basically what I just covered earlier. So I'm not gonna run through all of that.

00:16:02.180 --> 00:16:10.945
And then we just have to ensure that it's going to start running. So and then from there, it'll tell us that this is now going to run at Monday at 9AM

00:16:10.945 --> 00:17:03.796
every single week for us. But that is Max Hermes ten second cloud deploy. It is a learning loop that compounds every workflow that you teach it. You can schedule jobs that run on their own. They have a skill library that gets sharper every week that you're using it, and you can run on a model that's going to be 95% cheaper than Opus 4.7. Anyways, that's everything that I wanted to cover. So link to MiniMax agent will be in the description. It's completely free to sign up. You get 4,000 credits a day just for logging in. You don't have to even input your credit card, and it's $19 a month for the basic plan if you want more headroom. But if you want to sign up through my link, you will get bonus credits, so use that one. Also And make sure to check out our free school community if you want other frameworks and more videos on stuff exactly like this, how to be using AI more efficiently for your specific scenario, and making sure that you can actually get AI up and running as easy as possible. So make sure to check that out. Link will be down below in the description as well. Thank you guys for watching. I'll see you in the next video.