Advanced Code Review

Use this skill when a review needs to go beyond style feedback and surface the real engineering risks in a change. It is built for complex pull requests, sensitive code paths, test automation suites, and refactors where a shallow review would miss important trade-offs.

When to Use

Use this skill when the user asks for things like:

•"review this PR critically"
•"audit this change for maintainability or security"
•"look for architectural issues, not just syntax problems"
•"review this test automation code for flakiness and long-term cost"
•"analyze whether this refactor is safe to merge"

Typical scenarios:

•large or multi-file pull requests
•legacy code audits
•high-risk changes in authentication, payments, permissions, data flows, or CI pipelines
•changes that add tests but may still reduce confidence
•code that technically works but appears expensive to maintain

Review Principles

•Evidence over intuition - every important finding should point to observable code, behavior, or missing coverage
•Risk over volume - a few high-signal findings are better than a long list of cosmetic remarks
•Context before judgment - understand intent, constraints, and surrounding patterns before recommending change
•Severity must mean something - blockers and high-risk findings must be clearly separated from notes and polish
•Actionable feedback only - explain why the issue matters and what the safer direction looks like
•Acknowledge uncertainty - if the review lacks context, say so explicitly instead of pretending certainty

Review Workflow

Follow the phases in order.

Phase 0: Frame the review

Before reviewing, establish:

•scope: whole repository, changed files only, or one subsystem
•review goal: correctness, readiness, architecture, security, test quality, or debt discovery
•change type: feature, bug fix, refactor, migration, test-only, or infrastructure
•constraints: release pressure, backward compatibility, legacy boundaries, compliance needs

If the user does not provide this information, infer cautiously and list assumptions.

Phase 1: Understand the context first

Do not comment on code in isolation if context is available.

Check:

•surrounding modules and nearby patterns
•existing tests and how they express behavior
•documentation, ticket intent, or change summary if available
•whether the implementation matches the apparent architectural direction of the repository

If the request is based on a diff or pull request, identify:

•what behavior changed
•what behavior stayed implicit
•what could regress without detection

Phase 2: Inspect the change through multiple lenses

Use ./resources/review-checklist.md as the primary checklist.

Always review at least these dimensions:

•Correctness - logic, edge cases, state transitions, and error handling
•Maintainability - readability, cohesion, duplication, coupling, and naming quality
•Architecture - boundary violations, abstraction quality, hidden dependencies, and future change cost
•Security and privacy - trust boundaries, secrets, authorization, unsafe defaults, and logging risks
•Performance and scalability - obvious hot paths, unnecessary work, poor batching, blocking calls, or repeated setup
•Testing - missing coverage, weak assertions, test realism, determinism, and regression detection strength
•Operability - observability, diagnostics, fallback behavior, and failure recovery

When the reviewed code touches automated tests, also load ./resources/test-automation-review-lens.md.

Phase 3: Build findings with clear structure

Each meaningful finding should contain:

•severity: Blocker, High, Medium, Low, or Note
•area: correctness, architecture, test quality, security, performance, maintainability, or operability
•summary: short statement of the issue
•evidence: concrete file, behavior, or omission that supports the claim
•risk: why the issue matters
•recommended direction: the smallest safe improvement or follow-up

Do not inflate severity.

Use these severity rules:

•Blocker - likely wrong, unsafe, or merge-blocking
•High - significant regression, security, data, or reliability risk
•Medium - important quality issue that should be addressed soon
•Low - valid improvement but not urgent
•Note - observation, trade-off, or follow-up idea with no immediate action required

Phase 4: Write a review that is easy to act on

Use ./resources/review-report-template.md for the response structure.

The review should usually include:

•a short executive summary
•the highest-risk findings first
•specific strengths worth preserving
•missing tests, docs, or operational safeguards
•a final recommendation: approve, comment, or request changes

Keep feedback concrete and teammate-friendly. Prefer "Consider extracting this dependency boundary because..." over vague criticism like "This feels messy."

Phase 5: Self-check before finalizing

Before delivering the review, verify:

•the summary matches the actual findings
•duplicated comments are removed
•every non-trivial finding has evidence
•severity matches impact
•suggested fixes are proportionate to the problem
•the review distinguishes between immediate blockers and longer-term design advice

Advanced Review Heuristics

Signs a change is riskier than it first appears

•a refactor changes control flow and test snapshots still pass without stronger assertions
•the implementation introduces new branching but test coverage stays flat
•a helper reduces duplication but centralizes too many unrelated responsibilities
•the code hides I/O, retries, or state mutations behind innocent-looking abstractions
•logging or telemetry was changed in a way that weakens failure diagnosis
•the diff is small but touches a critical boundary such as auth, persistence, or test fixtures

Good advanced review behavior

•call out what is solid, not only what is wrong
•distinguish merge-now issues from roadmap-quality concerns
•connect design comments to business or delivery risk
•mention when more evidence is needed rather than over-claiming
•keep suggestions incremental unless the current approach is structurally unsafe

Bad review behavior to avoid

•nit-picking style while missing correctness or reliability risk
•calling something "bad practice" without explaining the consequence
•recommending rewrites when a targeted fix is enough
•marking everything high severity
•treating test code as lower quality or less deserving of design review

Output Expectations

The preferred output is a structured markdown review using the template in ./resources/review-report-template.md.

If the user asks for a shorter response, compress the output but preserve:

•high-level verdict
•severity-ranked findings
•evidence for important claims
•recommendation status

If the user asks for an example of the expected result, use ./resources/example-review.md.

Resource Map

•./resources/review-checklist.md - multi-lens checklist for advanced reviews
•./resources/review-report-template.md - reusable review output structure
•./resources/example-review.md - example filled review with severity-ranked findings
•./resources/test-automation-review-lens.md - extra review criteria for test suites and automation code