Dario Amodei: "We are near the end of the exponential" (Feb 2026): building for the endgame

Read the field note below to see how we apply this pattern in real Claude Code projects.

verified 2 months ago7 min

Inside Anthropic with Dario Amodei #5: The end of the exponential

Final entry in the Amodei Playbook. From the safety floor of 2023 to the endgame framing of 2026.

The interview

"The most surprising thing has been the lack of public recognition of how close we are to the end of the exponential... we're near the end of the exponential."

Dario Amodei on Dwarkesh Patel's podcast, February 2026

Important nuance: Amodei is not saying progress is plateauing. "End" means endgame. The top of the S-curve. Benchmarks pegged to human ability are being saturated. Models have gone from high-school-student level → college → PhD-level cognitive work, and the terminal point is expert-human performance on most cognitive tasks. Amodei puts the odds of "a country of geniuses in a datacentre" arriving this decade at around 90%.

What we heard

This is the opposite message from Karpathy's "decade of agents" framing (see The Karpathy Playbook entry #6). Same industry, same year, two thoughtful founders, reading the same capability curve, reaching opposite conclusions about timeline.

They're not actually contradicting each other. Karpathy is talking about agent reliability: the march from 90% to 99.999%, which is asymptotic no matter how capable the underlying model gets. Amodei is talking about raw cognitive capability: the distance between today's model and expert-human parity, which is compressing.

You can believe both. And if you're shipping production AI, you probably should.

What we actually do with this

We run every architectural decision against a two-question check:

Capability question (Amodei): If raw model capability is 10× current levels in 18 months, does this architecture take advantage of that, or does it wall it off? A system that hard-codes "GPT-4-class prompting" into business logic throws away the next two years of gains.
Reliability question (Karpathy): Does this architecture still need every nine I promised the customer, regardless of how capable the model gets? Better models don't replace retry logic, graceful degradation, cost guardrails, or traces.

Both must be answered. A system built only for the capability question collapses under production load. A system built only for the reliability question gets leapfrogged in 18 months. We aim for architectures that absorb capability gains and enforce reliability floors. Model-agnostic at the top, safety-scaffolded at the bottom.

Applied: InterviewLM's capability-agnostic spine

We designed InterviewLM from day one to absorb model capability without requiring a rewrite:

Provider abstraction: we route by task class, not by model name. "Reasoning task → use the current best reasoning model at acceptable cost." Today that's Claude Opus. When the endpoint changes, the routing layer absorbs it.
Structured output as the contract: agents emit typed JSON, not free-text. A 10× more capable model with the same output schema drops into the same pipeline without any code changes.
Eval graph as the gate: whenever a new model is released, we run it against the golden set before it sees production traffic. The decision to adopt is automatic. If scores improve without regressions, it ships. No engineering debate.

The result: in the 18 months we've been running it, InterviewLM has swapped underlying models three times. Each swap took an hour of engineering work. None required a re-architecture. If Amodei is right about the next 18 months, that same spine absorbs whatever arrives.

At the same time, the reliability scaffolding (PRL-3 gates from entry #1, the reasoning-trace discipline from entry #4) does not change when capability improves. The safeguards are model-independent. The capability is not.

The one thing to steal from this

Read your system's code and ask: where is the model hard-coded? Every "claude-3-5-sonnet" string in application logic is a line item you'll be migrating the day Anthropic, OpenAI, or a third party releases the next generation. Route by task class, not by model name. Build the routing layer once. Every capability gain from here is free.

Series complete. Inside Anthropic with Dario Amodei

Five entries, one arc from safety scaffolding to the endgame, one production pattern per entry:

Responsible Scaling Policy (Sep 2023). Production Risk Levels as the pre-ship gate
Claude's Constitution. The brilliant friend test for AI advisors
Machines of Loving Grace (Oct 2024). The ten-year lens on architectural decisions
The Urgency of Interpretability (Apr 2025). Observability-as-interpretability stack
"End of the exponential" (Feb 2026). Capability-agnostic spine + reliability-agnostic scaffolding

Next in the broader Playbooks project: Boris Cherny's Claude Code Setup. The viral workflow thread broken down: parallel agents, Opus-for-everything, CLAUDE.md as postmortem log, and the /commit-push-pr pattern.

Quick answers

What do I get from this cable?

You get a dated field note that explains how we handle this ai-industry workflow in real Claude Code projects.

How much time should I budget?

Typical effort is 7 min. The cable is marked intermediate.

How do I install the artifact?

This cable is guidance-only and does not ship an installable artifact.

How fresh is the guidance?

The cable is explicitly last verified on 2026-04-17, and includes source links for traceability.

Work with FRE|Nxt

We build the production AI systems we write about.

Cables are the field notes. The playbooks come from client engagements — multi-agent systems, RAG pipelines, and LLM cost cuts that ship and hold up in production. If something here maps to a problem on your roadmap, two ways in:

Get a free 2-page audit Book a 30-min discovery call

Audit capacity: 5 slots/month · No pitch deck · NDA on request

Same shelf · Fix a specific problem

claude-code·no artifact

Use auto mode, not --dangerously-skip-permissions

Two flags promise to stop Claude Code from pausing at every tool call. One of them reads your settings, honours your allowlist, and refuses to run anything g…

@frenxt · 8 mininstall →

claude-code·skill

Publish your stack to Cables (automated)

A skill that walks Claude Code through publishing your Claude stack to the Cables community in one conversation. No manual repo setup, no hand-written `stack…

@frenxt · 10 mininstall →

claude-code·skill

Replicate Ragav's stack (skills + plugins + scripts)

Pick the stack that matches what you're building. Each one is a single `npx` command. Plugins installed, skills synced, marketplaces configured, no bash scri…

@frenxt · 15 mininstall →

Share this cable

Share on Twitter Share on LinkedIn