Cole Medin · Youtube · 28:10

Anthropic Just Dropped a Masterclass on Building Agent Harnesses (for Large Codebases)

Cole Medin turns Anthropic's high-level blog post into a working repo — seven concrete components for the AI Layer that wraps Claude Code.

Posted

May 21st 2026

today

Duration

28:10

Format

Tutorial

educational

Channel

CM

Cole Medin

§ 01 · The Hook

The bait, then the rug-pull.

Anthropic published a high-level playbook for making Claude Code work in massive codebases. It names seven components but never shows you one. Cole built the whole thing — and renamed Anthropic's "harness" to the AI Layer, the third leg of every codebase next to code and tests.

§ · Stated Promise

What the video promised.

stated at 01:00 "Not only are we covering the article, but we're also gonna see concrete examples of all the strategies in action." delivered at 26:19

§ · Chapters

Where the time goes.

00:00 – 01:04

01 · AI Coding in Large Codebases

Pattern-interrupt cold open: tutorials are a dime a dozen, but nobody covers large codebases. Names Anthropic's post and promises a demo repo + plugin.

01:04 – 01:57

02 · What We're Covering

Table of contents: seven AI-Layer components Cole will walk through. Each maps to one Anthropic strategy and one concrete demo.

01:57 – 03:00

03 · How Claude Code Navigates Today

Claude Code uses agentic search — grep + folder walking, no embedding index. The tradeoff: works best when starting context is curated.

03:00 – 04:19

04 · The AI Layer

The thesis: the harness matters as much as the model. Codebase now has three parts — code, tests, AI Layer (CLAUDE.md hierarchy, hooks, skills, plugins, LSP, MCP, subagents).

04:19 – 08:40

05 · Lean & Layered Global Rules

Keep CLAUDE.md short — long rule files degrade performance. Use subdirectory CLAUDE.md files that load progressively as Claude walks into folders. You can also init Claude inside a subdirectory to scope the working tree.

08:40 – 10:31

06 · Sponsor: JetBrains Academy

Mid-roll sponsor read for JetBrains Academy AWS skill paths — learn in PyCharm, deploy in prepaid AWS sandboxes.

10:31 – 15:22

07 · Self-Improving Hooks

Hooks aren't just guardrails. A Stop hook can run a separate headless Claude session at end-of-turn to inspect the diff and propose CLAUDE.md updates while context is fresh. A SessionStart hook can pull per-team context (git state, Confluence docs).

15:22 – 18:16

08 · Path-Scoped Skills

The skills parameter most people miss: paths. Skills only activate when Claude touches matching files. Demo: api-add-route skill scoped to services/api/**. Clean mental model — global rules are conventions you must follow; skills are workflows you sometimes run.

18:16 – 21:57

09 · LSP & MCP for Symbol Search

Wrap a language server as an MCP so Claude can search by symbol (definition / references) instead of grepping strings. Critical once a repo passes ~100K LOC where grep gets slow and token-inefficient.

21:57 – 23:56

10 · Subagents for Exploration

Split exploration from editing. Send research/web/codebase-map tasks to subagents with their own context windows — the primary session keeps a clean context for the actual edits.

23:56 – 26:19

11 · Claude Plugin & Getting Started

Bundles every component into one install: /plugin marketplace add <path> then /plugin install helpline-ai-layer@helpline-tooling. Ships stop hook, explorer subagent, codebase-search MCP, and an example scoped skill.

26:19 – 28:10

12 · AI Layer Ownership

Anthropic's closing advice: identify a small champion team (or a hybrid PM/engineer) to build the AI Layer in a quiet investment period before rolling out org-wide. Cole pitches enterprise training, asks for the like + sub.

§ · Storyboard

Visual structure at a glance.

open — Anthropic post

hook open — Anthropic post 00:00

TOC — what we're covering

promise TOC — what we're covering 01:05

AI Layer diagram

value AI Layer diagram 03:20

harness diagram

value harness diagram 05:27

sporadic vs always-on

value sporadic vs always-on 11:53

stop hook running headless

value stop hook running headless 13:54

CLAUDE.md review markdown

value CLAUDE.md review markdown 15:38

path-scoped skill (skill.md)

value path-scoped skill (skill.md) 17:25

MCP slash command + prompt

value MCP slash command + prompt 20:14

plugin install in terminal

value plugin install in terminal 25:10

three phases of rollout

cta three phases of rollout 27:17

§ · Frameworks

Named ideas worth stealing.

03:00 list

The AI Layer (7 components)

CLAUDE.md hierarchy (lean & layered)
Hooks (self-improving)
Skills (path-scoped, progressive disclosure)
Plugins (bundle distribution)
LSP (symbol-level navigation)
MCP servers (extension)
Subagents (exploration + editing split)

Cole's rename of Anthropic's 'harness' — the third leg of a codebase next to code and tests. Every component maps 1:1 to an Anthropic strategy.

Steal for any AI-coding skill or product positioning

17:50 concept

Rules vs Workflows

Clean distinction for the perennial 'is it a CLAUDE.md or a skill?' question. Global rules are conventions you MUST follow; skills are workflows you sometimes RUN. Same scoping mechanic, different purpose.

Steal for any internal taxonomy doc where two tools overlap

04:30 concept

Sporadic vs Always-On

Anthropic's diagram shows CLAUDE.md as the only always-on component; everything else fires sporadically. Justifies ruthless trimming of CLAUDE.md and aggressive scoping of everything else.

Steal for context-budget framing in any prompt-engineering content

26:05 list

Three Phases of Claude Code Rollout

Quiet investment (champion team builds the AI Layer)
First rollout lands (early adopters)
Adoption spreads (org-wide standards)

Anthropic's org-adoption advice — small team builds the harness in quiet before rollout, to avoid both 'disappointed on day one' and 'everyone evolving their own separate AI Layers'.

Steal for any enterprise-AI-adoption sales deck

§ · Quotables

Lines you could clip.

00:22

"Claude and AI coding tutorials are a dime a dozen, but what people are not really covering nearly enough is how to use these tools to work in large codebases."

tight hook, identifies a market gap in 1 line → TikTok hook

03:16

"The harness matters as much as the model."

thesis line, repeated 3x in the video — the whole essay collapses to this → IG reel cold open

03:50

"I like to call it the AI Layer. I think that's more descriptive."

the renaming move — Cole's framework lives here → newsletter pull-quote

10:42

"Most teams think of hooks as scripts that prevent Claude from doing something wrong. But their more valuable use is continuous improvement."

reframe of a misused primitive → TikTok hook

17:43

"Global rules are your conventions. Your skills are the workflows."

perfect distinction phrase, fits in a tweet → newsletter pull-quote

21:02

"Once you get like into the six digits for lines of code, you need something like this because grep by itself is gonna be slow and really token inefficient."

concrete threshold — gives the LSP/MCP advice a trigger → newsletter pull-quote

§ · Pacing

How they spent the runtime.

Hook length22s

Info densityhigh

Filler8%

Sponsor blocks

08:40 – 10:31 · JetBrains Academy

§ · Resources Mentioned

Things they pointed at.

01:28linkAnthropic — How Claude Code works in large codebases ↗

24:06linkcoleam00/helpline (the demo repo) ↗

08:40linkJetBrains Academy AWS Skill Paths ↗

10:15linkDynamous Second Brain Bootcamp ↗

§ · CTA Breakdown

How they asked for the click.

27:40 subscribe

"If you appreciate this video and you're looking forward to more things on AI coding and Claude code, I would really appreciate a like and a subscribe."

Soft. Real CTA is buried — 'I do offer enterprise trainings... got my email in my bio' lands at ~26:30 before the like+sub close. No mid-roll CTA. Two soft hooks for the GitHub repo throughout.

§ 04 · The Script

Word for word.

HOOK opening / re-engagementCTA the pitch metaphor

00:00HOOKClaude and AI coding tutorials are a dime a dozen these days, but what people are not really covering nearly enough is how to use these tools to work in large code bases. That's what I wanna cover with you right now because you probably already have a complex code base or two or three. You've got the apps and platforms that you're building, your second brain.

00:21HOOKYou have these code bases that are tens or even hundreds of thousands of lines long, and it could be tough to get these coding agents to navigate those larger code bases and work in them effectively. And even if you don't have a complex code base yet, you'll get there, my friend.

00:36HOOKYou start with a simple idea, a simple code base, but as you evolve that code base, the coding agent strategies that worked before, they fall flat on their face. That's why I'm excited to get into this. Synthropic put out this article just a few days ago, how do you use Claude code to work in large code bases.

00:54HOOKAnd really these ideas apply no matter the coding agent that you are using. And there's a lot of gold in this blog post. So I wanna get into all of this.

01:02They do stay pretty high level in the blog post though. And so I also took all of their strategies and I built them into a demo code base for this video.

01:11So not only are we covering the article, but we're also gonna see concrete examples of all the strategies in action. And I even have a Claude plugin

01:20that makes it super easy for you in two commands to take a lot of the strategies that we're covering here and immediately bring them into any code base that you are working on. And so we'll get to that, but I wanna start pretty high level, share these strategies, and and let's see them in action as well. So Anthropic starts by talking about all of the pretty impressive places where Claude code is currently being used at an enterprise level across multi million line model repos, decades old legacy systems, distributed architecture spanning dozens of repositories.

01:50Basically, they're just making the point here that if you think your code base is too complex for Cloud Code, you are wrong. And then they go into how Cloud Code navigates a code base, at least before we have more of an AI layer. So the tool out of the box, it uses something called a gentic search.

02:08So we're not performing traditional rag or semantic search. There's no code base indexing with Claude code. Instead, it's going to navigate a code base more as an engineer would with command line tools like grep, just looking at the folder structure, using

02:23all of the command line tools at its disposal to identify the places for especially a larger code base to pay attention to and where it needs to edit. And so this is really powerful because then there's no index that you have to keep in sync, but the trade off is that Claude works best when it has enough starting context to know where to look.

02:42And so this really gets us into a lot of the strategies that we'll cover here. It's all about how do we curate that context upfront so Claude can navigate a more complex code base effectively, knowing where to actually look based on the request that we have for it.

02:57So that then brings us to the main point of this blog post that really sets the stage for all the strategies. The harness matters as much as the model. A lot of people get really hyper fixated on model benchmarks and they think that tools like Claude Coding Codex,

03:12the power really comes from how good the underlying large language model is. And yes, that matters, but honestly what matters even more is the ecosystem built around the model, the harness. And I like to call it the AI layer.

03:24I think that's more descriptive. It's really everything that they lay out right here with quite a few paragraphs. I also have a nice diagram to make this even simpler.

03:32The AI layer is the set of context and tools that you give your coding agent to work on a code base. And so traditionally, a code base would have two main parts.

03:41It would have the code and then it would have the tests. And so now with the AI layer, we have a third component of every code base introduced.

03:49This is everything like your global rules, your skills, your MCP servers and sub agents. Really every single individual feature of Claude code that gives tools or context, that is a part of your AI layer. And so there's seven things that we have here.

04:05Couple you might not be as familiar with like LSP and hooks, but we'll talk about all of that because really each of these seven map to one of the strategies that Claude code covers. This is where I have a concrete example for each of them. So let's get into this.

04:19So the initial strategies that Anthropic covers are all about making it as easy as possible for Claude code to navigate your code base at scale. And a lot of it centers around the first and maybe even most important part of your entire AI layer, which is your global rules. So take a look at this.

04:37They have this visual representation for how often the each part of the AI layer is used throughout a Claude code session.

04:46You can see that most of them are sporadic, like your hooks and skills and the LSP for navigation. We'll talk about all these as well. But your global rules as your foundation, it is dictating the behavior of clog code the entire time.

04:59So you better spend a good amount of time strategizing around your context curation here. And so their first tip is to keep your global rules lean and layered.

05:09Something that I see a lot of people do unfortunately is create these global rule files that are thousands of lines long. That is not a good idea. There are actually studies out there that prove that that can hurt your coding agent performance even if you think that being really specific and comprehensive helps.

05:24You're just gonna overwhelm your LLM with context. You just need core information. What is the code base about?

05:29Give it a little bit of an idea of the tech stack or architecture, for example. I mean, this is just an example that I have in this repo.

05:36Um, then your general conventions and gotchas like what I have right here, commands to run for things like testing and getting the dev server spun up, like that's all you really need. So keeping it lean. And what Anthropic means by layered is you can actually have claw dot m d files in sub directories.

05:54And so I have the main claw dot m d at the root of my repository here. That means that whenever I start a Claude session like this, it is always going to have these rules loaded. But then as soon as I navigate and start editing files in one of these subdirectories,

06:12it's also gonna load in that claw dot m d automatically. So if I start working in the API service for example, I'm gonna load in my core rules or I should say those are already loaded, but then I'm also going to load in the API service rules that I have in this separate claude.md. So I'm building

06:29up the list of conventions based on where I'm actually operating in the code base. It's like the idea of progressive disclosure that we have with Claude code skills. This is really powerful because if you have a massive code base, you're gonna have a ton of conventions,

06:43but most of them are gonna be specific to certain slices of the code base. So let's just load in the conventions we need depending on where we're working. Because whenever you have some kind of GitHub issue or Jira ticket or whatever,

06:54hopefully it's scoped to a very specific part of your code base. And then another thing you can do if you're really confident where you need to work in a code base is you can actually initialize clog code in that sub directory. So if I know for example that based on a Jira ticket or GitHub issue, I'm only gonna be working in the API service,

07:13then I can, you know, right click in Versus code, copy my path, and then within here, just for the sake of example, I guess I'm already there, but I can change my directory to that path and then I can open up Claude here. And the power of this is now this is the current working directory for Claude code. So unless I tell it to, it's really gonna stick to editing files in just this directory.

07:34So it'll load the claw dot m d here and then it still will load the root claw dot m d. So you can see that it does automatically walk up the directory tree and load every claw dot m d. So the root context isn't lost, but we're just honing Claude code in on that part of the code base.

07:51And so basically, you're doing the navigation here. So the rest of their strategies are like, you know, how can you help Claude navigate things effectively?

07:58But most of the time, especially if you are an engineer, you know where to start. Now, you don't know where to start, that's where this strategy comes in.

08:07Building up some kind of code based map when the directory structure doesn't do the work. And this usually I put in my global rules.

08:15So I don't have it in this example here. But often what I'll do is I'll have a section that outlines the directory structure, like all the subdirectories, maybe like a brief description of each of them.

08:24CTAThat way Claude can help me do the discovery, help me figure out what slice of the larger code base to focus on based on the work that I have. So usually it comes down to Claude's gonna help you figure that out or you're just going to immediately know and initialize Claude code there. The sponsor of today's video is JetBrains Academy.

08:43CTANow, I've tried a lot of AI courses in the past and most of them have this problem. The way that I'd put it is that the course ends where the real work actually begins. And what I mean by that is you'll go through some material and some really basic exercises,

08:57CTAbut then you don't get to really deploy anything. And JetBrains Academy is different with their skill paths. Here, when you learn a concept, you get to apply it to a real project immediately.

09:09CTAAnd so you do your work and go through the lessons in the IDE, and then you get to right away deploy what you've built in AWS sandboxes. So you start by picking a path. Let's say we wanna do build and deploy custom LLMs with Python and AWS.

09:23CTAThis seems very relevant right now. And so we have the course layout here. This is the syllabus, and you can see that each of the sections, it'll open the course in your PyCharm IDE.

09:33CTASo we get to go through the material where we're doing all of our coding already. The AI assistance is built right into the IDE. I can navigate through the lessons and go through all the material right here really easily.

09:43CTAAnd then as I'm doing my exercises, it's just right here in the IDE. So I get to code as I normally do. And then when it comes time to run and deploy things like the fine tune model that we have in this lesson, we get to do it in an AWS sandbox.

09:57CTAThis is not a mock. It is running in the cloud, but it's fully prepaid. You don't need an AWS account.

10:03CTAThis is what I wish I had when I was learning how to build and fine tune models. So when you finish a skill path, you have real projects deployed live that you can host on GitHub, talk about interviews, and get hired for. And you have certificates both from JetBrains and AWS to back it up.

10:19CTASo if you're looking to build proficiency and credibility with generative AI and LLM engineering, I would highly recommend checking out JetBrain Academy's skill pads. I'll have a link to them in the description.

10:29CTACool. So there are some more strategies to cover here like scoping your tests and link commands per subdirectory, ignoring certain files like build artifacts, your coding agent never reads them.

10:39But I wanna move on now to talk about the next part of the AI layer and that is hooks. And you'll see in a second why I wanna cover this right after global rules. So you can use hooks to make your entire AI layer, your entire setup self improving.

10:55This is really really cool. This is part of the goal that I was talking about. So most teams think of hooks as scripts that prevent Claude from doing something wrong.

11:03So a lot of people use hooks like a pre tool use hook to stop Claude from editing in certain directories, removing files or folders, that kind of thing. But their more valuable use is continuous improvement. And so take a look at this.

11:16A stop hook can reflect on what happened during a session and propose claw dot md updates while the context is fresh. Right?

11:25So the the hook runs at the end of the session. And I have a live demo of this. I'll show you.

11:30I actually built out both of these hooks here. And then a start hook can load team specific context dynamically. So every dev gets the right setup without manual configuration.

11:40So based on the role or the part of the code base they're editing, we can have a hook that will even go out to confluence for example and pull documentation for that team, that function, that part of the code base, whatever. So I have actually a pretty basic example of that here.

11:55And so I have a hook and so you can see in my settings dot JSON, this is where I have the start and end hooks defined. So propose Claude dot MD updates for the stop hook and then the session start context for the start hook. And so what this hook does, this is just more of a basic example,

12:13is it's going to load context around git. And so any kind of unstaged changes that I have, like a change to this file here, looking at the git history as well. So take a look at this.

12:23If I go into Claude and I start a new session and then I say, what did the start session hook tell you about this session? Obviously, it's a little cheesy, but just to show you that it loaded the context, we have this orientation here. The working tree is clean.

12:36Here are our recent commits. Right? And so like this is just giving it some context going into like here is what we're currently working on and here is what we worked on recently.

12:45And you could extend this, like I said, to pull things from confluence based on the developer that is starting Cloud Code. There's a lot that you can do here. And then take a look at this.

12:54To demo the stop hook for you, I'm going to give a really simple request for something that I wanna change. Obviously, if I was doing work for real, I go through a more extensive process of planning and implementing and validating. But here I'm just asking it to make a simple change so that I have something to then propose a change to the global rules.

13:13Because something really important that you need to do, and you can see that the process actually ran here in order to propose some changes. Uh, something that's really important to do is as you're evolving your code base, you need to make sure that your rules are evolving as well. It's really, really bad when your claw.md

13:29goes stale because you made some changes in the code base where the it kinda, you know, dictates something needs to be added to the global rules or something has to be updated. And so that's why it's really powerful to have this kind of process that automatically proposes these changes.

13:45So take a look at this. This hook runs whenever Claude

13:49stops. So whenever it's done with its turn. So you saw that terminal pop up for a little bit.

13:54It runs a separate Claude session in headless mode to look at these changes, look at the global rules, and propose if anything needs to be tweaked. And so it outputs that in a markdown document. Take a look at this.

14:05I have my Claude markdown review. And so we have the reflection that just ran right now.

14:11Are the two areas that were touched. So it's gonna look at those subdirectory global rules as well. And here it decided no changes needed.

14:18Adding a trial enum value follows the existing model only convention. So the thing that we really care about at a high level still holds up based on these changes. And so maybe it's not the best example because it didn't decide to change anything, but I think it's also really powerful because usually we don't need to change our Claude code or claude.md conventions.

14:36That's especially why we keep these files so lean. But maybe for example, I could say, you know, make a change that would require updating the claud.md.

14:45So I'll come back and see what it does with this. And so there we go. We had to change something bigger in the billing service and now in our markdown review, it is recommending making an update to the second bullet in the claude.md

14:58for our billing service subdirectory. Pretty neat. So now we can take these recommendations

15:03and we can action on it ourselves. We can have a conversation with a separate Claude session to make these changes. It's up to you how you want to take this forward.

15:11The power I'm just trying to show you here is we can have this self reflection process constantly running in the background, making these suggestions that we can, uh, you know, just action on when we're actually ready to.

15:22So the next part of the AI layer that Anthropic focuses on here is skills. And you probably know what a skill is. They've been blowing up all over the Internet the past few months.

15:31It's really like the main way to extend the Claude code right now with new workflows and capabilities. And so, like, this is an example of a skill right here for adding API routes in this code base. Really a skill is some kind of set of steps, some kind of process reusable prompt that you have for Claude code.

15:50And these are really important in large code bases because you're gonna have dozens or maybe even hundreds of task types. Like this would apply to a task type of building an API endpoint. And so not all expertise needs to be present in every session, which is the same reason why we have different clon dot m d files in subdirectories, which there is definitely some overlap here that I'll talk about in a second.

16:12And so skill solve this through progressive disclosure. So we're offloading specialized workflows and domain knowledge and we load it when we actually need it.

16:21So that way we're not bringing in prompting and workflows for things that don't apply to the current task at hand. And so when we define a skill, we have the name and the description. The description is what is given to the coding agent right away.

16:35And if it decides like, okay, based on the description, I should use the skill, then it'll read the full skill dot m d file. I've talked about skills a lot on my channel already. But the parameter that most people don't know about, and this is what Anthropic talks about right here, we can make it so that skills can be scoped to specific paths so they only activate in relevant parts of the code base.

16:58Like we know that this process for adding API routes that we want to be very repeatable, it only applies when we're gonna be reading and editing files in the API services directory. And so we can scope it there. Really, really powerful.

17:11It's a way to basically enforce it like when we touch this part of the code base, we're going to bring this convention, this workflow into this session context. And so like I said, there is a little bit of overlap here with that and the subdirectory claw dot m d files. Right?

17:26Like we're loading this in only when we work here. Same thing when we operate in here. We're also going to read this claw dot m d.

17:34The distinction that I'd like to make is that global rules are your conventions. It's the rules that you need to follow. Like, every route is registered here, for example.

17:43Your skills are the workflows. So we have rules and we have workflows.

17:48So that distinguishment kinda helps me understand the overlap, but really for a lot of these sorts of conventions, you can kinda do it as a skill or a clon dot m d.

17:57The more important thing here is we just wanna scope these conventions and rules to the part of the code base where they actually matter so we're not overwhelming our coding agent with context it doesn't care about. So Anthropic talks about plug ins next, but I'm gonna cover that at the end because I'll show you how to use my plug in to incorporate all these ideas in your own code base.

18:16So let's move on to talk about language server protocols. I'm excited for this because I just started incorporating this into my own Claude code ecosystem. It's really powerful.

18:27Essentially, you give Claude the same navigation that a developer has in their IDE. And And a lot of bigger companies especially build own custom LSPs to really help Claude navigate through their code base effectively.

18:39And so an LSP is something that is really built into any IDE by default. It's the kind of thing that allows you to know like in Versus Code, I can control click here to immediately navigate to the definition for the class that I used in this other file.

18:53So that kind of like type hinting and navigation and highlighting like all that is in LSP. And so essentially with an MCP server,

19:03we can give Cloud Code this exact functionality that we as the engineers have in our IDE to make it so that we have better search capabilities than just GREP by itself or can complement some of the tools like grep that are built into cloud code natively just through the CLI commands that it has. And so what I built here, I'm actually kind of knocking two birds with one stone because they talk about LSP and then talk about MCP servers as a way to extend everything

19:29is I built a local MCP server that comes with this code base. It comes with the plug in that I'll cover in a little bit as well that gives Claude code some new code base search capabilities. And so take a look at this.

19:42I'm gonna go into a new Claude session here. If I do slash m c p, you can see that I have the code based search enabled. There are three tools here to complement the search capabilities that I already have.

19:54And so I'm going to paste in a prompt, find every place that monthly total sense is referenced in this repo. And I know that's like oddly specific, but that's the point is we need something very specific to search for here. And I'm telling it not to use grep just for the sake of the demo.

20:08I'm telling it to use a symbol level approach and that's going to key in that it needs to leverage the MCP server that I built here that leverages the language server protocol. So it's able to do more intelligent searches

20:20that it might figure out it needs to if I don't tell it to not use grep or you could just, you know, build in some instructions in your global rules for how you want to use these searches. We can see here that it used my whereas and find references tools in my custom MCP server.

20:34And so here are the results. We have one definition and two references. Pretty cool.

20:39So I I know that like I'm talking about complex code bases here but my demo is kinda simple in the end but I kinda have to have that balance there of like a somewhat complicated code base but still it has to be easy to like parse through and show the results here. But that's an example of using an MCP server to expose a language server protocol.

20:59And really for massive code bases, once you get like into the six digits for lines of code, you need something like this because Grep by itself is gonna be slow and really token inefficient as you're trying to navigate through a code base. This is a lot more of a directed search looking for things like the definitions and references

21:19for things like classes and variables. So that's a quick overview of LSP and MCP and how I use them together. You gotta have some kind of harness to give better search capability to Claude code when you're working in larger code bases.

21:33And really, they operate like skills, just use sporadically throughout your session. So like with skills, we're loading in instructions when we need those conventions or workflows, whatever. LSP, whenever we need to perform those searches to find definitions, references, things like that, we'll call upon the tools.

21:48MCP, pretty similar. Right?

21:50Like we need to perform a search, take some kind of external action. We call upon one or multiple tools for an MCP server at that time. Now the last part of the AI layer that we still have to cover is sub agents.

22:01But this one's nice and simple. The the advice that Anthropic has here is simple but still really powerful. We want to use sub agents to split exploration

22:10from editing. So the idea with a sub agent is that we send in a task, like we want to search the web for, you know, best practices for this kind of architecture or maybe to do some kind of code based exploration to find the part of the code base to focus on.

22:26We send in a task and it runs with its own context window. It does all the analysis it needs to and then it returns a summary back to our primary cloud code session to reason about an action on. And these kind of exploration

22:42tasks that we wanna give to a sub agent, you can imagine them getting to hundreds of thousands of tokens. So if we're not using a sub agent and we have our primary Cloud Code session do that web research or code based exploration, by the time we get to the actual editing, we're already gonna have this extremely bloated context window.

22:59That's why we want to dispatch the work to sub agents, especially because with exploration, usually all we need is that summary back. Right?

23:06Like, here are the recommendations for the tech stack. Here's a part of the code base we're gonna have to address based on this Jira ticket. Like, that's the kind of thing that you task a sub agent with.

23:15And so I don't actually have that much of a demo here for sub agents because I use them liberally, like, all the time. Especially at start of the conversation, I'll say something like, I want you to spin up three sub agents here. One to research the database, one the back end, one the front end.

23:29Help me figure out how I can add in authentication. I don't know. I'm just kinda throwing off something off the cuff here, but you have sub agents built into a lot of these coding agents now like Cloud Code and Codex.

23:39And so you don't even have to define your own custom sub agents like a lot of people did before. You just send off a request like this and now it's just gonna use the explorer sub agent that we have built into Cloud Code. And so it takes care of that whole, dispatch, getting the summary back, and everything.

23:54Alright. So the rest of the article that we haven't covered yet is really covering a lot of the strategies that I already hit on, like running LSP servers so Claude can search by symbol not by a string, talking about actively maintaining the Claude dot MD files. So this is where the stop hook comes in to make those recommendations as we're operating with Claude code.

24:13The other thing that I wanna hit on here is the plugin that I have for you. So if you go to the read me for this demo repo that I'll link to in the description, I have instructions for taking this to your own code base.

24:26Now obviously some of the things like the claw.md's and the subdirectories are specific to me, But this plugin is going to give you the self improving stop hook, the explorer sub agents, there's more of like a custom sub agent that I built that you can use.

24:38And it's gonna give you the code based search MCP server with that LSP. So you have that whole searching harness. And then I'm going to give you a more generic skill that you can use in a as an example that shows what it looks like to use that path parameter to scope a skill to a certain subdirectory.

24:55So just kinda consider this plugin a starting point. If you wanna like really quickly pull in these things to experiment on your own code base, you can install this plug in on any code base, even one that you already have built out with its own AI layer already. So all you have to do is slash in cloud code slash plug in marketplace ad and then give the path to the repository.

25:15So you still have to clone this repo locally because I don't have this hosted in NPM. So you give the path and then make sure you add the tooling folder at the end. And then you just do plug in install helpline AI layer at helpline tooling.

25:27Then you go through the whole installation process here and then boom, it'll install all these things for you to start playing around with. So that's one way to do it. I just wanted to add a plugin to make it really convenient.

25:37The other way to get started with a lot of the ideas that I have here aside from, you know, reading the anthropic blog post is just to clone this repository, point your Claude code at it, you know, like copy this directory, you know, right click copy path, give this to Claude code and say, hey, these are a bunch of cool strategies Cole shared with me for working with complex code bases.

25:56Help me understand how they work and how I can incorporate it for my code base. That's always the easiest way to really take any repo these days is just give it to Claude code and have it, uh, help you understand it and apply it. So that'd be my recommendation for getting started here.

26:11CTASo I hope that you found these strategies really useful. You can apply them right away to your larger code bases. Even if you had some of these things incorporated already, I hope there were some good golden nuggets for you.

26:22CTAAnd so the last thing that I wanna end on is talking about some really good advice that Anthropic also gives at the bottom of their article here. It's all about assigning ownership for clon code management and adoption.

26:34CTAAnd I've been around a lot of companies as I've done my consulting and trainings. I know this is really good advice. Essentially, what they're saying here is you have a an individual or more likely a smaller team to champion the initial build out of the AI layer for your organization.

26:50CTAAnd so what that looks like is you start with a quiet investment period. You have a couple of people that build out the the rules and skills and the LSP and the MCP servers, the whole AI layer for the organization, and then roll it out to people over time.

27:05CTAAnd the power of this is you get to create something that's really foundational for everyone to adopt together and then people can get more consistent results with Claude code or whatever coding agent faster so they're not disappointed when they first use the tool. You wanna avoid people being really disappointed when they first use it because they don't have an AI layer and you want to avoid everyone evolving their own separate AI layers when really you want a standard for the organization.

27:30CTASo really, really good advice that they have here. This is also something that I help with. And so I do offer enterprise trainings where I help you build up the AI layer, understand the core methodologies for AI coding and create that standard

27:44CTAfor your adoption of coding agent tools like Cloud Code. So definitely, got my email in my bio. Reach out to me if you're interested in that.

27:51CTAOtherwise, I hope that these strategies were useful for you. I appreciate you going through everything here. Let me know if you have any questions in the comments below.

27:59CTAOtherwise, if you appreciate this video and you're looking forward to more things on AI coding and Claude code, I would really appreciate a like and a subscribe. And with that, I will see you in the next video.

— full transcript

§ 05 · For Joe

Steal the format: translate a high-level post into a working repo.

AI-Layer playbook

When an authoritative source (Anthropic, OpenAI, Cursor) drops a high-level post, the gap between 'good idea' and 'in my repo today' is a video.

Pick an authoritative post that's high-level on purpose — Anthropic engineering posts, OpenAI dev posts, model-vendor playbooks. The gap is the whole opportunity.
Build the missing concrete examples in a real demo repo BEFORE you write the script. Cole's whole video rides on the fact that the helpline repo already exists and works.
Rename the abstract concept into your own framework word (Anthropic 'harness' → Cole 'AI Layer'). You now own a phrase your audience will quote back to you.
Map each abstract bullet 1:1 to a concrete demo. Show the file, run the command, show the output. The transcript should be unusable without the screen recording.
Bundle the demos into something one-command installable (here: a plugin). The plugin IS the lead magnet — no email gate, just download.
Drop one clean reframe per major section. 'Rules vs workflows.' 'Sporadic vs always-on.' These are the lines that get screenshot.
End with the source's organizational/strategic advice and pitch enterprise training off the back of it. Soft CTA, low friction, the value already delivered.

§ 05 · For You

What this means if you're building with Claude Code.

If you're setting up your own repo

Most of what makes Claude Code feel 'smart' on a big project isn't the model — it's the seven pieces of context and tooling around it.

Keep your root CLAUDE.md ruthlessly short — a few hundred lines max. Put folder-specific rules inside subdirectory CLAUDE.md files; Claude walks the tree and loads them automatically.
Try a Stop hook that runs a second headless Claude session to review what just changed and propose CLAUDE.md updates. The model itself becomes your doc-maintenance loop.
When a skill (or rule) only applies to part of your codebase, scope it with the `paths:` parameter. Don't pay context tax on irrelevant workflows.
If your repo is past ~100K lines, you'll outgrow grep. Wrap a language server in an MCP so Claude can search by symbol (definition / references), not by string.
Default to subagents for any exploration step (web research, codebase mapping) — keep the main session's context window clean for the actual editing.
Clone coleam00/helpline as a working reference. Don't try to build everything from scratch — start from a repo that already has the seven pieces wired up.

§ 06 · Frame Gallery

Visual moments.

01:56

11:53

21:32

25:52