Harrison Chase: Ambient Agents (2025–26): the UX after chat

Read the field note below to see how we apply this pattern in real Claude Code projects.

verified 2 months ago7 min

Harrison Chase on Production Agents #5: Ambient Agents

Final entry in the Chase Playbook. From an 800-line weekend project to the UX that replaces chat.

The concept

"Ambient agents are AI systems that operate continuously in the background, responding to events rather than direct human prompts. By building agents that listen for and respond to background event streams, founders can enable massively parallel and autonomous workflows, scaling impact far beyond what is possible with chat interfaces."

Critical nuance: "Ambient does not mean fully autonomous, and it's still really important that we are able to interact with these ambient agents."

Harrison Chase on Sequoia's Training Data podcast, 2025–26

Chase introduces the "Agent Inbox" as the primary UX: a feed where humans review, approve, reject, or modify what ambient agents have proposed to do.

What we heard

The 2023–25 version of "AI product" was almost always a chatbox. Ambient agents are the first serious argument that the next version won't be.

Three shifts are happening at once:

Trigger shift: from "user types a prompt" to "event fires in a stream."
Concurrency shift: from "one conversation at a time" to "many parallel agent threads running in the background."
Interaction shift: from "chat window" to "inbox/approval feed."

The inbox shift is the most important one for product teams to grok. Users don't scale to N chat sessions. They do scale to an inbox. They already use one for email. A well-designed agent inbox means a single person can supervise 20 agents the way a manager supervises 20 reports.

Chase's critical caveat, ambient ≠ autonomous, is the load-bearing design decision. Humans still gate consequential actions. The agent proposes; the human approves. Full autonomy is neither desirable nor necessary for most valuable workflows.

What we actually do with this

For new engagements in 2026, we run a "chat vs ambient" question at the scoping stage. The signals:

Signal	Shape of product
User wants to ask a question	Chat interface
User wants to have the AI do work in the background	Ambient agent
Work has a natural trigger (event, schedule, threshold)	Ambient agent
Work is one-shot and discrete	Chat interface
Multiple pieces of work happen concurrently per user	Ambient agent + inbox UX
Each unit of work needs bounded human approval	Ambient agent + inbox UX
User is paying attention continuously	Chat interface

The cutoff is roughly: does the user want a conversation, or do they want outcomes while they're not looking? Both are valid. They require different products.

Applied: what we're building next on InterviewLM

InterviewLM's 2025 product was chat-shaped: a candidate joins a session and talks to an interviewer agent. That's the right shape for the interviewing job.

The 2026 roadmap has an ambient layer stacked on top: a hiring manager inbox where the agent proposes sourcing actions, schedules follow-ups, drafts feedback messages, and flags candidates for review. The manager reviews in batches, not real-time. Each proposed action has one-click approve, modify, or reject.

Architecturally this is a different system: event-driven triggers (new candidate enters pipeline, interview completes, threshold crossed), long-running LangGraph sessions (see entry #2) tied to these triggers, and an approval queue persisted in Postgres. The inbox UX is the primary interaction point. The chat interface is still there, but it's no longer the centre of the product.

Estimated ratio of eventual usage: 30% chat sessions, 70% inbox interactions. The inbox is where the leverage is.

Design rules we're applying

From what Chase has publicly described and from our own in-progress work:

Every ambient action is a proposal, not a commit. The agent writes to a queue; a human (or an auto-approval rule for low-risk actions) commits.
Every inbox item has a "why". Users need to see the agent's reasoning to trust the proposal. Reasoning-trace discipline (Amodei #4) becomes user-facing.
Revisit is first-class. Users can go back in the agent's decision history and intervene at any prior step. This is where LangGraph's checkpointing pays off.
Low-risk actions auto-approve; high-risk actions require explicit review. The PRL framework (Amodei #1) maps directly onto per-action approval rules.

None of these are Chase-originals. They're good product design applied to agents. His contribution is naming the pattern early enough that teams can build toward it intentionally.

The one thing to steal from this

On your next AI product decision, ask: does the user want a conversation, or do they want the outcome while they're not watching? If the honest answer is the second, you're not building a chatbot. Stop designing one. Sketch the inbox. Figure out which actions auto-approve and which require review. That conversation. Not the chat UX. Is the product.

Series complete. Harrison Chase on Production Agents

Five entries, one arc from weekend project to post-chat UX, one pattern per entry:

800-line weekend project (Oct 2022). Minimal viable abstraction
LangGraph as runtime (Jun 2024). Design the state schema first
Better models alone won't ship agents (2025). Five-layer production checklist
Deep Agents (Jul 2025). Coordinator + subagents + filesystem + planning-as-tool
Ambient Agents (2025-26). "chat vs ambient" product cutoff; agent inbox as primary UX

Playbooks project. Where we are

Four series complete covering four AI leaders from four angles on production AI:

The Karpathy Playbook. The intellectual arc from Software 2.0 to the march of nines
Inside Anthropic with Dario Amodei. Safety floor to endgame
Boris Cherny's Claude Code Setup. Personal workflow to design philosophy
Harrison Chase on Production Agents. Abstractions to runtimes to product UX

Each series is a self-contained playbook. Read across all four and you have a working model of where production AI is heading in 2026.

Quick answers

What do I get from this cable?

You get a dated field note that explains how we handle this ai-industry workflow in real Claude Code projects.

How much time should I budget?

Typical effort is 7 min. The cable is marked intermediate.

How do I install the artifact?

This cable is guidance-only and does not ship an installable artifact.

How fresh is the guidance?

The cable is explicitly last verified on 2026-04-17, and includes source links for traceability.

Work with FRE|Nxt

We build the production AI systems we write about.

Cables are the field notes. The playbooks come from client engagements — multi-agent systems, RAG pipelines, and LLM cost cuts that ship and hold up in production. If something here maps to a problem on your roadmap, two ways in:

Get a free 2-page audit Book a 30-min discovery call

Audit capacity: 5 slots/month · No pitch deck · NDA on request

Same shelf · Fix a specific problem

claude-code·no artifact

Use auto mode, not --dangerously-skip-permissions

Two flags promise to stop Claude Code from pausing at every tool call. One of them reads your settings, honours your allowlist, and refuses to run anything g…

@frenxt · 8 mininstall →

claude-code·skill

Publish your stack to Cables (automated)

A skill that walks Claude Code through publishing your Claude stack to the Cables community in one conversation. No manual repo setup, no hand-written `stack…

@frenxt · 10 mininstall →

claude-code·skill

Replicate Ragav's stack (skills + plugins + scripts)

Pick the stack that matches what you're building. Each one is a single `npx` command. Plugins installed, skills synced, marketplaces configured, no bash scri…

@frenxt · 15 mininstall →

Share this cable

Share on Twitter Share on LinkedIn