WEBVTT

00:00:00.000 --> 00:00:01.120
In 2025,

00:00:01.120 --> 00:00:15.015
everyone thought that we were gonna create these new browsers and inject AI inside of them, but we're starting to realize that this is not what's happening at all. In this video, I wanna talk about how you can prepare for the future

00:00:15.095 --> 00:00:16.855
of using AI agents

00:00:17.015 --> 00:00:20.615
in these brand new super apps like Claw Desktop

00:00:20.775 --> 00:00:35.490
and Codex. I'm about to play some clips from this podcast. This is Lenny's podcast, and he invited a guy named Dan Shipper on his podcast. And I believe he has an incredible intuition into where AI agents are going. Basically,

00:00:35.570 --> 00:00:47.205
what happened was Anthropic realized at some point that with Cloud Code, once you have a coding agent on your computer that can build anything, it's actually really good for any kind of work you wanna do. And people started just

00:00:47.685 --> 00:01:00.645
hacking Cloud Code essentially to do all their work. So Anthropic then built Cowork, which is, you know, a little bit of a nicer wrapping around Cloud Code, but it's fundamentally the same thing. You know? I think OpenAI made a couple of different bets. They

00:01:01.260 --> 00:01:05.660
launched the Codex desktop app, and I think the Codex desktop app takes

00:01:05.980 --> 00:01:12.060
if you look at all the lessons that, like, Anthropic learned, they went from Claude code to Cowork,

00:01:12.220 --> 00:01:16.940
and you can kinda see that in the tabs on the on the Anthropic desktop app UI.

00:01:17.340 --> 00:01:38.700
I think OpenAI was just like, we we see where this is going. Like, let's just skip to that. And I think Codex right now, it's my daily driver. I, like, spend all all my time in it, basically. I flip the clock every once in a while, but I think they're getting the paradigm right. And it's clear to me that whoever is in the lead, because I again, I think it'll change. Whoever's in the lead, it feels very obvious to me that

00:01:39.180 --> 00:02:14.290
all of the work that you do is going to be in one of those surfaces. So that's what Dan believes. Dan believes that most work that people do will be done in one of these two apps. And so these super apps are really good at doing any coding task or any knowledge work task, like creating documents, spreadsheets, or basically anything that you could theoretically do on your computer. And what Dan explains next, I really wanna make sure that you have the right context to understand exactly what he's saying. So he's about to talk about a tool that he uses called Proof. And so his company, Every, created a product called Proof.

00:02:14.675 --> 00:02:18.835
And what it is is a document editor. So this is a document

00:02:19.155 --> 00:02:19.795
editor,

00:02:20.035 --> 00:02:21.875
and I want you to just picture

00:02:22.035 --> 00:02:23.395
like Obsidian

00:02:23.395 --> 00:02:28.115
or Google Docs. Right? He created a document editor except

00:02:28.390 --> 00:02:31.110
it's made to be agent native,

00:02:31.510 --> 00:02:36.310
and this agent native document editor is made to be used by humans

00:02:36.390 --> 00:02:38.790
and by agents at the same time.

00:02:39.190 --> 00:02:40.230
And because

00:02:40.550 --> 00:02:42.135
both Codex

00:02:42.135 --> 00:02:45.975
and Claude Code are developing these browsers,

00:02:46.295 --> 00:02:55.175
this browser over here that sits right next to my agent, Claude Code also has a very similar browser that sits right next to your agent chat.

00:02:55.600 --> 00:03:07.200
Because these super apps all will come with a full browser. Right? Imagine this is Google Chrome that you can use alongside your agent. It'll completely change the way people use computers.

00:03:07.725 --> 00:03:11.965
Okay. Let's listen to the rest of this clip. Uh, for example, when I'm writing a document,

00:03:12.285 --> 00:03:13.805
Codex has a browser

00:03:14.205 --> 00:03:17.325
in, uh, in the app. It has an in app browser.

00:03:17.645 --> 00:03:23.340
And when I'm writing a document, I just go into one of my one of my codex threads, which I have one thread for every project.

00:03:23.820 --> 00:03:33.180
And I just open the in app browser. I go to the document. I usually do it in proof, which is this online mark markdown editor I built. And then I just have codex

00:03:33.180 --> 00:03:44.915
running and watching me in proof. And codex can see what I'm doing. I can see what Codex is doing. It's all kind of in one place, which is the an extension of the same thing that made Cloud Code work really well originally.

00:03:45.395 --> 00:03:48.515
And I basically feel like I have this parallel

00:03:48.995 --> 00:03:58.970
work buddy that not only can it respond and write in the document, but then it can go do research. It can go it can use my computer to basically do anything that I can do on my computer, and that's, like,

00:03:59.530 --> 00:04:01.050
incredibly powerful.

00:04:01.050 --> 00:04:07.955
So just to illustrate, right now on Codex, I have this browser open, and I can type directly into Google Docs.

00:04:08.115 --> 00:04:27.670
And I can also say, hey, Codex, can you please, uh, look at your memory and please just tell me, like, all my favorite tools that I use and just put it in this doc, please? And because Codex has full context over my browser, it can view the browser, it can see exactly what I'm working on, and it has browser use. I could also say browser.

00:04:28.070 --> 00:04:31.670
Uh, please, when you're done, use browser use to, like, make sure it's the right formatting.

00:04:32.775 --> 00:04:36.455
Right? I can also allow Codex to control this browser

00:04:36.615 --> 00:04:50.590
and make sure it's the proper formatting. These AI agent tools that have full control over our computer has full memory and context over my integrations and everything like that. And look at that. It just wrote this down.

00:04:50.910 --> 00:04:53.550
So now it's verifying Google Doc formatting.

00:04:54.590 --> 00:04:55.390
And look,

00:04:55.550 --> 00:05:01.950
now it's fully controlling a browser. That little mouse is not me. That is browser use within Codecs.

00:05:01.950 --> 00:05:04.815
Hey, guys. This is me from the future. Codex,

00:05:04.895 --> 00:05:08.735
right before I was about to upload this video, Codex just got an update

00:05:08.895 --> 00:05:11.695
to their in app browser. I'm not kidding.

00:05:11.855 --> 00:05:14.575
As of one hour ago, when you use Codex,

00:05:14.655 --> 00:05:22.300
you stay logged into your browser. So here I'm fully signed into the browser on Twitter and watch. I'm gonna go to a new tab.

00:05:23.580 --> 00:05:25.100
I'm gonna open the browser

00:05:25.260 --> 00:05:28.780
and now I can sign into anything. So I can go to twitter.com

00:05:28.780 --> 00:05:32.940
and I can open it up and you can see here I'm still signed into my previous account.

00:05:33.875 --> 00:05:37.315
Here I can go to docs.google.com

00:05:37.475 --> 00:05:53.280
and check this out. I'm already signed into my Google Docs account. And here I have Codex open. This is my Notion. I'm signed into my Notion. If I were to create a new chat by pressing command n and I were to go open up my browser, I could click browser and I could type notion.com

00:05:53.280 --> 00:05:56.560
and I should. And I think all I have to do is click log in

00:05:57.120 --> 00:06:05.105
and yes, I'm already logged in. So it's logged into my Notion. We are approaching a full web browser directly inside Codex.

00:06:05.185 --> 00:06:07.665
So the reason this is new

00:06:07.905 --> 00:06:10.625
is we are shifting from a world of

00:06:11.025 --> 00:06:11.905
browser

00:06:11.905 --> 00:06:14.145
tabs to task tabs.

00:06:14.305 --> 00:06:17.000
If we go to an application like Comet,

00:06:17.240 --> 00:06:19.640
which was an AI browser,

00:06:19.800 --> 00:06:21.640
and they added this assistant

00:06:21.720 --> 00:06:29.400
side panel. And this assistant side panel can actually control the browser, and they basically created these new AI browsers.

00:06:29.495 --> 00:06:35.735
And there's some other examples as well. There was Dia browser and then Atlas, which was OpenAI's browser,

00:06:35.975 --> 00:06:40.375
and even Google Chrome itself is starting to get some AI features.

00:06:40.455 --> 00:06:48.080
But this new future with your AI agent platforms or your super apps, instead of it being a bunch of browser tabs,

00:06:48.240 --> 00:06:51.520
it's a bunch of task tabs on the left sidebar.

00:06:51.600 --> 00:06:58.175
And when you click on a task tab, it opens up an agent thread and then you see your browser.

00:06:58.495 --> 00:07:04.175
Right? It opens your browser. Your agent will open your browser, or you can open your browser manually,

00:07:04.255 --> 00:07:06.975
but you have all of the different browser

00:07:06.975 --> 00:07:07.615
tabs

00:07:08.030 --> 00:07:10.030
for your agent thread.

00:07:10.270 --> 00:07:13.550
That's the paradigm that we're shifting to for productive work.

00:07:13.710 --> 00:07:35.175
Instead of having all these random tabs open, you just have your task, your agent, and then the browser tabs that are relevant to that task. Right? We're going from this right here. Right? And and you get lost. I know you've used web browsers and you end up getting lost and you're like, okay. What was I even doing? And we're switching to something that looks like this. We have different tasks.

00:07:35.810 --> 00:07:37.730
Each task has different,

00:07:37.890 --> 00:07:39.330
uh, side panels.

00:07:39.650 --> 00:07:41.810
Right? Here's my personal landing page.

00:07:42.050 --> 00:07:43.250
So the browser

00:07:43.250 --> 00:07:48.930
that's open right here, this happens to be running a website. Right? This is deployed on the Internet

00:07:49.585 --> 00:07:53.265
and this browser tab is relevant to this task.

00:07:53.425 --> 00:07:56.385
So I switch from this task to this task

00:07:56.545 --> 00:08:12.840
and here we have the Google Doc open. And so I can interact with my agent and my agent can control this document and I can control this document. This app right here is an agent native app, and that is an app that you control with your agent.

00:08:13.080 --> 00:08:14.520
You can directly

00:08:14.520 --> 00:08:22.815
edit it. Your agent can see your changes. Your agent can edit it. You can see your agent's changes. So it's becoming this collaborative

00:08:22.815 --> 00:08:26.415
process with AI agents. As AI agents get smarter,

00:08:26.655 --> 00:08:43.380
the AI agent will, in theory, be able to do more and more of this. And the fact that it can control your computer and your browser means that maybe over time, it'll actually predict what you wanna do and it will just do it for you. And so what does this mean for SaaS or software in general?

00:08:43.780 --> 00:08:45.940
And the reason why Dan,

00:08:46.180 --> 00:08:48.660
uh, the reason he built the thing called Proof

00:08:49.005 --> 00:08:55.565
is he believed that using Google Docs is not an ideal experience. You have to sign in, authentication's

00:08:55.565 --> 00:08:56.605
really annoying,

00:08:56.765 --> 00:09:04.600
uh, the interface is not ideal for using AI agents. Instead, he built his own document editor, which is much more lightweight,

00:09:05.000 --> 00:09:08.760
but it's much better for AI agents and human collaboration.

00:09:09.080 --> 00:09:14.760
And so what his thesis is is that all of the best SaaS companies will kind of become

00:09:15.335 --> 00:09:20.055
these agent native apps, apps that you can use with your AI agent.

00:09:20.375 --> 00:09:34.210
Instead of building a SaaS app, right, on Google Docs, you can click this Gemini button and you can have AI edit this. The problem with that is this AI that's connected to Google Docs doesn't have the same context

00:09:34.290 --> 00:09:45.035
as Codex does. Codex is connected to all of my apps. Codex knows everything about me. And so the future of SaaS might not be building an app that comes with an AI agent inside.

00:09:45.355 --> 00:09:50.475
The future of SaaS might just be to create an app that your existing agent

00:09:50.635 --> 00:09:59.050
that more and more people will use will have access to. Right? Make an app that your users can use with their own agents.

00:09:59.290 --> 00:10:14.165
They can bring their own agents. They can use their own tokens rather than providing an agent and providing tokens like Lovable or Replit or a tool like Gamma. And so to conclude this video, I wanna do some predictions and some suggestions.

00:10:14.325 --> 00:10:19.440
So my predictions for the future are as follows. So let's take a look at first prediction.

00:10:19.840 --> 00:10:22.560
I think that Claude code and

00:10:22.720 --> 00:10:24.800
I will say codex will

00:10:25.200 --> 00:10:42.255
add multiple browser tabs per session or per task. If we were to go to codex, you'll see that we have a browser open right here. I can create a new tab. I can open this up, and I can open up a browser right here, and I can go to espn.com,

00:10:42.255 --> 00:10:42.655
for example.

00:10:43.460 --> 00:10:51.940
I can't click here and open up a new browser. I believe that you'll be able to open as many browsers as you want per chat thread.

00:10:52.180 --> 00:10:58.445
And to kind of piggyback on this prediction, I believe that the agent will start opening one or multiple

00:10:58.685 --> 00:11:03.085
browser tabs when you create a new chat. Let's say I wanna create

00:11:03.325 --> 00:11:04.925
some Twitter posts

00:11:05.165 --> 00:11:23.080
and I use a tool like Tightfully, which is where I draft all of my tweets. It may open my Notion and it may also open type fully automatically whenever I suggest that I want to write some tweets. Why would it do that? Codex might do that automatically because every time I go to write tweets, I open up Notion

00:11:23.080 --> 00:11:24.760
and I also open up type

00:11:25.215 --> 00:11:34.335
The agent that you're interacting with in the super app will start learning based on your activity in the browser. It'll start doing things automatically.

00:11:34.495 --> 00:11:37.775
And maybe the future of, like, automating your workflows

00:11:37.855 --> 00:11:39.270
is less about

00:11:39.590 --> 00:11:40.470
constructing

00:11:40.470 --> 00:11:53.965
them and more as about just doing high quality tasks over and over again. And then over time, your own AI agent will just learn how to do tasks for you. Wait. I forgot. That was my second prediction.

00:11:54.045 --> 00:12:02.445
Right? The agent will start automatically opening browser tabs when you start tasks. I genuinely believe that's the direction it's going. And then finally,

00:12:02.845 --> 00:12:09.790
I think the agent native apps are going to evolve into a new type of app and these are gonna be generative UIs

00:12:09.790 --> 00:12:11.070
that integrate

00:12:11.310 --> 00:12:25.155
with your own tools. Let me explain what I mean here. On Codex, you can create plugins And for instance, I have a Gmail plugin. And what I can do is I can say, read my last

00:12:25.155 --> 00:12:30.035
20 emails, figure out the three that I need to respond to, and generate a draft for all three.

00:12:30.835 --> 00:12:34.230
What I believe the future of AI

00:12:34.230 --> 00:12:37.510
in these super apps will be it'll generate

00:12:37.750 --> 00:12:45.270
a UI or the or a mini app that shows the input message and the response. Right? And your agent will automatically

00:12:45.270 --> 00:13:06.510
create these little generative UIs for you to refine it. Here's a drafted response to this person. Let me know if you like it. And you'll be able to open it up in your browser. And once you check your agent's work, you'll be able to send it directly in the app that your agent generates for you. You'll be able to create little mini apps that integrate with all of your tools,

00:13:06.750 --> 00:13:17.105
and you'll actually be able to use all of your integration. And so that's how we're gonna kind of transition into this Jarvis future where you tell your computer to do something, boom, generative UI,

00:13:17.265 --> 00:13:33.710
you might make some refinements like Tony Stark, and then you're like, send. Send it off, and that is going to be the future. And I don't think we're that far off from that. I think three to six months into this new generative mini apps are what I call them. And so my suggestions to you are as follows.

00:13:33.950 --> 00:13:37.390
I want you to start thinking about your entire job at the task

00:13:37.470 --> 00:13:47.645
and SOP level. Take all of the things that you do and organize them in a list of tasks. What are all the things that you do on the way to completing something?

00:13:48.045 --> 00:13:58.125
And if you were to create these little documents, you'll be able to organize things at the task level. And this will allow you to be organized once Codex and ClaudeCode add these

00:13:58.370 --> 00:14:13.075
full browser features, which I guarantee you they will. And you'll be equipped to, like, kind of divide all of your work into these tasks. You'll be able to get way more done. If you are a builder or if you are an indie hacker or you wanna make money selling applications,

00:14:13.155 --> 00:14:14.515
I highly recommend

00:14:14.675 --> 00:14:24.915
try to take a risk and try and go from the world of these static applications that you might submit to an app store to these mini apps or these agent

00:14:24.915 --> 00:14:25.715
native

00:14:25.800 --> 00:14:28.520
apps that you and agents can use.

00:14:28.840 --> 00:14:35.960
And I guarantee you once you go down this rabbit hole you will find so many interesting ideas. You can create a an app

00:14:36.200 --> 00:14:38.120
that is basically a skill

00:14:38.325 --> 00:14:40.485
that other people can install

00:14:40.485 --> 00:14:52.325
onto Codex that makes a certain task easier to do. And then finally, I just think you should learn Codex and ClaudeCode. I'm telling you Codex and ClaudeCode are the future of all knowledge work. Coding tasks,

00:14:52.930 --> 00:14:54.370
general knowledge tasks,

00:14:54.690 --> 00:15:14.565
I think most things are just gonna be done inside Codex and Claude Code. I'd a 100% agree with Dan Chipper from the Lenny podcast. I think he's spot on, and I think learning how to use Codex and Claude Code, basically the most important thing that you can do in the world of business right now. Hey, guys. Thanks for watching this video. By the way, this is ClaudeCode

00:15:14.565 --> 00:15:23.510
or Codex running in my text messages. This is a new product that my team and I have been working on. This is my James Bond agent that runs on my phone or my computer.

00:15:23.830 --> 00:15:26.790
Hello, buddy. This is Claude

00:15:26.790 --> 00:15:28.150
Opus 4.7

00:15:28.150 --> 00:16:00.170
responding to me. I can change it to any other model. This is running twenty four seven on a computer. Right? And it has files. So it's basically if you're running OpenClaw in the cloud twenty four seven except you can use any model you want. You have access to all of the files. You can create skills or you can browse our built in skills. There's so many of them. You can very easily connect it to iMessage. This agent is built to be iMessage first and allow you to just get done anything you would want to get done that you would do on ClaudeCode or Codecs.

00:16:00.170 --> 00:16:13.925
And you have all of your connections, and you can create automations as well. And this agent runs twenty four seven. Your laptop does not need to be open. I hope you guys enjoy this product. We've been working on it for a while. Anyway, I'll see you guys here for the next video. Peace.
