Inside Anthropic’s Bet on Claude Agents that Work While You Sleep | Jess Yan
A practical look at how Anthropic PMs build and use Claude agents to understand code, monitor feedback and analytics, prep for meetings, and get work done overnight.
Dear subscribers,
Today, I want to share a new episode with Jess Yan.
As product lead at Anthropic, Jess has a first-hand look at how Anthropic teams are moving from prompting AI to building long-running agents that work overnight. In our episode, Jess showed me how to build a Claude analytics agent from scratch, then covered how Anthropic PMs use agents internally to understand the codebase, monitor customer feedback, and make better product decisions.
Watch now on YouTube, Apple, and Spotify.
Jess and I talked about:
(00:00) AI agents that work while you sleep
(03:46) Claude Managed Agents help for long-running tasks
(05:22) Demo: Building a Claude analyst agent from scratch
(09:28) Traces: How to see where an agent got stuck
(15:11) Evals: How agents grade their own work
(20:20) 5 ways in which Anthropic PMs use agents internally
(28:06) Processing a 4,000-company waitlist with agents
(32:50) To build your first agent, focus on helping one person
(36:54) What changes when agents can work overnight
Evan helps you be the most prepared person in a meeting
Evan is built to prepare you for high-stakes meetings. It pulls context from your apps, does CIA-level research on every attendee, and hands you a presidential-style briefing before you walk in. Built for founders, sellers, VCs, and anyone whose meetings are consequential, because showing up unprepared isn’t just awkward, it’s costly.
Top 10 takeaways I learned from this episode
Jess and I covered two topics in the interview:
How Anthropic PMs use agents internally
How to build long-running Claude agents
How Anthropic PMs use agents internally
Anthropic PMs use agents internally for at least five PM workflows:
Understanding the codebase. Instead of asking engineers what shipped or how something works, Jess can inspect pull requests, deployments, and product behavior directly. “Access to our codebase has been the biggest unlock for me.”
Synthesizing user feedback from multiple channels. Jess has agents monitor customer Slack channels to get more signal before deciding what to build next.




