Skill: Inverted Testing Pyramid Coverage Review

Purpose

Audit a codebase’s test strategy using an inverted testing pyramid:

•Contract/E2E tests (highest value, user-visible behavior)
•Integration tests (real boundaries: DB, HTTP, queues, storage, browser APIs)
•Component/API contract tests (thin seams)
•Unit tests (only for critical invariants)

Goal: identify coverage opportunities that improve confidence per test-cost, not just increase test count.

Inputs

•Repository source code
•Existing test suites
•API specs / contracts (OpenAPI, protobuf, GraphQL, Gherkin, etc.)
•CI config and test reports (if available)
•Known critical user journeys

Optional:

•Production incident list
•Flaky test history
•Code ownership map

Review Rules

•Prefer behavior-visible and boundary tests over implementation-detail unit tests.
•Avoid mock-heavy tests when real dependencies are feasible.
•Treat contracts as authoritative. If code and contract diverge, note explicitly.
•Classify each recommendation by risk, confidence gain, and effort.
•Focus first on paths that can cause user harm, data corruption, auth/security failures, or money-impacting issues.

Process

Step 1: Inventory and classify tests

Create a complete test inventory and classify each test as:

•contract_e2e
•integration
•component_contract
•unit

For each, capture:

•file/path
•feature/flow covered
•dependencies used (real/mock)
•speed (slow/medium/fast)
•flakiness signals
•current CI stage

Step 2: Build behavior map

List top user-visible flows and system boundaries. Examples:

•auth/login/session refresh
•checkout/payment
•create/update/delete lifecycle
•async job processing
•external API integration
•permission/role checks
•failure/retry/idempotency paths

Map flows to existing tests and mark:

•fully covered
•partially covered
•not covered

Step 3: Gap analysis by pyramid level

For each flow, identify missing tests at each level:

•Missing contract/E2E behavior assertions
•Missing integration tests across real boundaries
•Missing API/component contracts
•Missing unit invariants (only where justified)

Step 4: Mock audit

Identify tests where mocks reduce confidence. Flag opportunities to replace mocks with:

•ephemeral DB/container
•local queue/storage
•fake-but-protocol-accurate test servers
•clock injection (instead of global time mocking)

Step 5: Risk-prioritized recommendations

Produce a ranked backlog with:

•recommendation
•pyramid level
•affected flow
•risk addressed
•expected confidence gain (High/Med/Low)
•effort (S/M/L)
•suggested test shape and location
•“why now”

Step 6: Minimal migration plan

Provide a 2-4 week phased plan:

•Week 1: highest risk + fastest wins
•Week 2+: deeper integration/contract hardening Include explicit de-prioritization of low-value tests.

Required Output Format

A) Coverage Scorecard

•Contract/E2E: X%
•Integration: X%
•Component/Contract: X%
•Unit: X%
•Overall confidence posture: Weak / Medium / Strong

(Use qualitative scoring if exact percentages are unavailable)

B) Behavior Coverage Map

Table format:

User Flow / Boundary	Contract/E2E	Integration	Component	Unit	Status
Login/auth	✓	✓	✓	✓	Full
Payment checkout	✗	✓	✗	✓	Partial
...	...	...	...	...	...

Include notes on critical gaps.

C) Gap Analysis

For each pyramid level, list:

•Missing contract/E2E tests: flows with zero end-to-end coverage
•Missing integration tests: real boundaries tested with mocks only
•Missing component contracts: API contracts not validated
•Unnecessary unit tests: low-value implementation-detail tests

D) Mock Audit Results

List tests that should be upgraded from mocks to real dependencies:

•Test file/path
•Current mock usage
•Suggested replacement (e.g., "ephemeral SQLite", "testcontainers Postgres", "httptest server")
•Confidence gain

E) Prioritized Recommendations

Ranked table:

#	Recommendation	Level	Flow	Risk	Confidence	Effort	Why Now
1	Add E2E checkout test	Contract	Payment	High	High	M	Money-critical
2	Replace DB mocks with ephemeral DB	Integration	CRUD	Med	High	S	Remove flakiness
...	...	...	...	...	...	...	...

F) Migration Plan

Week 1: Critical gaps + quick wins

• Recommendation #1
• Recommendation #2
• Recommendation #3

Week 2-3: Integration hardening

• Replace major mocks
• Add boundary tests

Week 4: Contract validation

• API contract tests
• Schema validation

De-prioritize:

•Unit tests for simple getters/setters
•Tests coupling to internal implementation
•Duplicate coverage

Usage Example

bash

# Run the skill
@testing-skill analyze codebase for testing coverage gaps

# The output will follow the A-F format above and can be used to:
# - Justify testing investment
# - Guide sprint planning
# - Onboard new developers to testing strategy
# - Track progress over time

Success Metrics

•Reduced production incidents from untested paths
•Faster, more confident deployments
•Decreased flaky test rate
•Improved developer confidence in refactoring