Audit your E2E coverage
Install a skill or read the field note below to see how we apply this pattern in real Claude Code projects.
We had tests. We also had regressions. The problem was coverage shape, not raw test count. Run this command to install a skill and start from a working baseline instead of rebuilding the setup from scratch.
Files this command writes (1 file)
.claude/skills/e2e-review/SKILL.md← artifact/SKILL.md
Run this skill from your repository root so test paths and docs resolve correctly.
Audit your E2E coverage
We had tests. We also had regressions. The problem was coverage shape, not raw test count.
What we tried
We used an E2E review skill to inventory user-critical flows and map each flow to an existing test. The prompt we gave Claude Code was specific:
Read every route file and every page component, then read every test file. Map each test to the flow it covers. Return a table: flow name, test exists (yes/no), test file path.
We pointed it at src/app/ for routes and tests/e2e/ for test files, and asked it to treat each top-level route as a potential flow entry point.
What happened
The skill exposed blind spots quickly:
- Flows with no test at all
- Flows with setup assertions but missing completion assertions
- Flows that existed only in manual QA notes
On the first pass, Claude Code made some confident mistakes. It matched tests to routes by filename similarity. A file named checkout.spec.ts was counted as covering the /checkout route even though the test only validated cart state, not payment completion. It also counted beforeEach and afterAll setup blocks as tests, inflating the coverage numbers by about 15%. We corrected both by telling it to match on explicit test() or it() calls only, and to check that each test included an assertion against a post-flow state (a confirmation message, a redirect, a database side effect) rather than just reaching the flow's entry screen.
What we learned
- Measure coverage by user journeys, not file count. A project can have 200 test files and still be missing the three flows that matter most.
- Every critical flow should have: enter flow, complete flow, and confirm side effects. Steps without a completion assertion are scaffolding, not coverage.
- Keep one markdown coverage matrix in-repo so regression gaps are visible during PR review. We named ours
docs/e2e-coverage.md. A single table of flow, status, and test file path that lives next to the code. - Re-run the audit when you add new routes or pages, not on a fixed schedule. A monthly reminder to audit often misses the gap introduced by a new feature shipped in week two. Trigger the audit from your definition-of-done checklist for any PR that adds a route.
- Work the gap list in order of user impact, not technical complexity. A missing test for the signup confirmation screen matters more than a missing test for the admin settings page, even if the settings test is easier to write. Sort by how many users hit the flow per week.
When this doesn't fit
- Projects without a defined routing layer: if routes aren't in a single directory or config file, Claude Code's audit will miss flows. You need a route manifest first.
- Generated test files: coverage audit assumes test files reflect intentional coverage. Auto-generated snapshots or fixture files will inflate results.
- Teams without user journey definitions: the audit maps tests to flows, but if flows aren't documented anywhere, Claude Code invents its own groupings. Define your top 10 journeys first.
Next
- Next. Fix your prompt caching.
Quick answers
What do I get from this cable?
You get a skill plus a dated field note that explains how we use it in real Claude Code workflows.
How much time should I budget?
Typical effort is 20 min. The cable is marked intermediate.
How do I install the artifact?
Run npx frenxt-cables add e2e-coverage-audit. The install block shows the files it writes and any prerequisites before you run it.
How fresh is the guidance?
The cable is explicitly last verified on 2026-04-15, and includes source links for traceability.
Work with FRE|Nxt
We build the production AI systems we write about.
Cables are the field notes. The playbooks come from client engagements — multi-agent systems, RAG pipelines, and LLM cost cuts that ship and hold up in production. If something here maps to a problem on your roadmap, two ways in:
Audit capacity: 5 slots/month · No pitch deck · NDA on request
Use auto mode, not --dangerously-skip-permissions
Two flags promise to stop Claude Code from pausing at every tool call. One of them reads your settings, honours your allowlist, and refuses to run anything g…
Publish your stack to Cables (automated)
A skill that walks Claude Code through publishing your Claude stack to the Cables community in one conversation. No manual repo setup, no hand-written `stack…
Replicate Ragav's stack (skills + plugins + scripts)
Pick the stack that matches what you're building. Each one is a single `npx` command. Plugins installed, skills synced, marketplaces configured, no bash scri…