The bait, then the rug-pull.
Jack Roberts opens with a blunt provocation: 99% of people do not know what they are leaving on the table. Then he spends 21 minutes proving it — showing how a three-model triad running overnight through OpenRouter delivers near-frontier AI work at a price so low you can afford to retry it a hundred times.
What the video promised.
stated at 00:07 "I am gonna show you exactly how to connect Hermes with the most powerful models, from $0, no limitations or rate limits." delivered at 12:35
Where the time goes.
01 · Software costs less than minimum wage
Cost reframe: AI is now cheaper than a junior dev. Hermes vs Claude Code distinction.
02 · DeepSeek V4 pricing advantage
100x cheaper than frontier models. $75/M tokens vs $0.87/M. 95% of performance. Benchmark comparison.
03 · OpenRouter as the single key
One API key unlocks all models with usage tracking. Introduces multi-brain model system concept.
04 · Multi-brain model system
ChatGPT $20 sub for GPT-5.5, Gemini CLI free. Live demo of Gemini analyzing a YouTube channel visually.
05 · Six OpenRouter features most people miss
Nitro, Exacto, openrouter/auto, BYOK, Fallbacks, Zero-completion.
06 · The Triad framework
Plan (Opus 4.7) + Execute (DeepSeek V4 overnight) + Critique (GPT-5.5). Three models, one verdict, no brain isolation.
07 · The Pantheon and Orpheus persona
Hermes dashboard for visually building specialist personas. Creates Orpheus: the deep-work triad persona.
08 · Connecting OpenRouter to Hermes
Terminal: hermes setup model, select OpenRouter, enter API key. BYOK setup for DeepSeek.
09 · Soul.md — feed Hermes who you are
Identity, mission, goals, key metrics, communication style. The more context Hermes has, the smarter every task.
10 · Live Orpheus demo — niche analysis
Which Texas local service niche for AI/web services? Triad surfaces fire/water/mold restoration as top pick.
11 · Wrap and CTA
Hermes + DeepSeek = agent that grows with you. Next video teased on maximizing Hermes potential.
Visual structure at a glance.
Named ideas worth stealing.
The Triad
- Plan (Opus 4.7)
- Execute (DeepSeek V4)
- Critique (GPT-5.5)
Three-model AI loop: conductor plans, cheap worker grinds overnight, critic tears apart until shippable.
The Pantheon
Named specialist personas in Hermes each wired to a specific model mix.
Soul.md
Context document feeding Hermes identity, goals, business details, metrics, communication style.
OpenRouter modifiers
- :nitro
- :exacto
- openrouter/auto
- BYOK
- Fallbacks
- Zero-completion
Six string modifiers appended to any model name to change routing/reliability/cost behavior.
Lines you could clip.
"Would you pay 1% of the price for 95% of the value?"
"Software now costs less than minimum wage."
"WD-40 was the fortieth version that actually worked, hence the name."
"If you just ask Claude directly, I have found it just agrees with you for no reason."
How they spent the runtime.
Things they pointed at.
How they asked for the click.
"how to get Hermes to its maximum potential, which we are gonna learn in this video right here"
Soft next-video CTA only — no subscribe ask, no product pitch. Clean and low-friction.
Word for word.
Steal the triad.
Let the cheap model do the overnight grinding — Opus sets the strategy, DeepSeek does the work, a critic closes the loop.
- Set up OpenRouter as your single API key — one key, every model, usage dashboard included.
- Wire DeepSeek V4 as the worker model for any task that can run overnight: research, analysis, code review, content outlines.
- Always add a critic pass before shipping — single-model sycophancy is real, multi-model critique breaks it.
- Build a Soul.md or equivalent context file so every agent task starts with full business context.
- Use :exacto suffix on any model doing tool calls — not all models are certified, and agentic systems break on bad tool calls.
- The triad scales: swap any model in any slot depending on cost vs quality tradeoffs.
How to make AI actually useful for your decisions.
Stop asking one AI one question — run your decision through three different models with different roles and you will get an answer worth acting on.
- Ask an intelligent model to break down your problem and write the brief first, before you ask for answers.
- Use a cheap model to do the heavy research or analysis — you do not need to pay top dollar for every step.
- Always run a critic pass: give a different model the output and ask it to tear it apart before you trust the result.
- Feed your AI assistant context about who you are, your goals, and constraints — vague prompts get vague answers.
- Use OpenRouter as a single entry point so you can switch models without managing multiple API accounts.




































































