Karpathy's vibe coding (Feb 2025): and why "mostly works" is not a production SLA
Read the field note below to see how we apply this pattern in real Claude Code projects.
The Karpathy Playbook #4: Vibe coding
Part 4 of 6. Tracing Karpathy's public thinking with FRE|Nxt Labs production commentary.
The tweet
"There's a new kind of coding I call 'vibe coding', where you fully give in to the vibes, embrace exponentials, and forget that the code even exists. It's possible because the LLMs (e.g. Cursor Composer w Sonnet) are getting too good. Also I just talk to Composer with SuperWhisper so it's not even coding, I just speak."
@karpathy, 2 February 2025
One year later, in his retrospective, he called it "a shower of thoughts throwaway tweet". And noted he still can't predict tweet engagement after 17 years on the platform. The throwaway tweet became a generational label.
What we heard
Karpathy wasn't advocating sloppiness. He was describing what happens when the friction of writing code drops below the friction of thinking about code. At that threshold, you iterate at the speed of ideas.
The trap is stopping there. "Mostly works" is not a production SLA. Every production system we've shipped has a vibe-coding origin story and a non-vibe production-gate ending.
What we actually do with this
We split every AI-assisted build into two explicit phases:
Phase 1. Vibe phase (intentionally fast) We do exactly what Karpathy describes. We use Claude Code to scaffold, generate, iterate at idea speed. We don't review individual lines. We run it and look at output.
Phase 2. Production gate (intentionally slow) Before any output leaves our environment we run:
npm run build. Type errors are load-bearingnpm run lint. Catches the copy-paste artifacts AI loves to leave- A smoke test against the real API (not a mock)
- A cost estimate: tokens-in × token price at expected QPS
The vibe phase might take 20 minutes. The production gate takes 40 minutes. The ratio is the point. We're not abandoning rigour, we're front-loading creativity and back-loading verification.
The concrete result
On the InterviewLM engagement, we used this pattern to prototype eight distinct agent personas in two days. Vibe phase shipped the scaffolding. Production gate caught the three that had prompt-injection surface areas. Total time to production-ready: six days.
Without the vibe phase: probably three weeks of careful upfront design. Without the production gate: three personas with live security issues.
The one thing to steal from this
Add a pre-commit hook that runs your build + lint. It forces you to exit the vibe phase before you ship. Claude Code with Turbopack makes this nearly zero latency. There is no longer a meaningful cost to running a build.
Next in this series
#5. Software 3.0 (June 2025). Four months after "vibe coding," Karpathy gave it a formal name at YC AI Startup School: "The hottest new programming language is English." What that reframing unlocks.
Quick answers
What do I get from this cable?
You get a dated field note that explains how we handle this ai-industry workflow in real Claude Code projects.
How much time should I budget?
Typical effort is 5 min. The cable is marked beginner.
How do I install the artifact?
This cable is guidance-only and does not ship an installable artifact.
How fresh is the guidance?
The cable is explicitly last verified on 2026-04-17, and includes source links for traceability.
Work with FRE|Nxt
We build the production AI systems we write about.
Cables are the field notes. The playbooks come from client engagements — multi-agent systems, RAG pipelines, and LLM cost cuts that ship and hold up in production. If something here maps to a problem on your roadmap, two ways in:
Audit capacity: 5 slots/month · No pitch deck · NDA on request
Use auto mode, not --dangerously-skip-permissions
Two flags promise to stop Claude Code from pausing at every tool call. One of them reads your settings, honours your allowlist, and refuses to run anything g…
Publish your stack to Cables (automated)
A skill that walks Claude Code through publishing your Claude stack to the Cables community in one conversation. No manual repo setup, no hand-written `stack…
Replicate Ragav's stack (skills + plugins + scripts)
Pick the stack that matches what you're building. Each one is a single `npx` command. Plugins installed, skills synced, marketplaces configured, no bash scri…