Documentation Audit Skill

Purpose

Audit and repair markdown documentation in codebases to ensure documentation claims match actual code behavior while maintaining documentation structure and completeness.

Success Criteria

•Accuracy: All documentation claims verified against code
•Navigation: Clear structure with functioning cross-links and README files where needed
•Currency: Obsolete documentation archived or removed
•Completeness: All significant features documented

Scope

Included:

•Markdown documentation files (*.md)
•Text-based diagrams (Mermaid, PlantUML, Graphviz, ASCII)
•Documentation structure and navigation

Excluded:

•Code comments and language-specific doc comments (godoc, JSDoc, etc.)
•Image-based diagrams
•Point-in-time documents (docs/plans/*)

When to Use This Skill

Use this skill when:

•Auditing documentation accuracy after significant code changes
•Onboarding to a new codebase and discovering documentation gaps
•Preparing for releases to ensure documentation is current
•Maintaining documentation as part of regular code health practices

Architecture: Multi-Pass Progressive Refinement

The skill operates in four passes, each building on the previous.

Pass 0: Repository Indexing

Build a comprehensive index of the repository before verification begins.

Create TodoWrite tasks for:

•Extract symbol graph using symbolic code analysis tools
•Identify API contracts from canonical sources (OpenAPI, protobuf, GraphQL schemas, configuration files)
•Map documentation cross-links and navigation files
•Index ownership metadata and last modification dates

Activities:

•Use mcp__serena__find_symbol to extract exported symbols from each package
•Use Glob to find canonical sources: **/*.proto, **/openapi.yaml, **/*.graphql, config schemas
•Use Grep to map documentation links (search for [.*](.*\.md) patterns)
•Note: Keep this pass lightweight - index only what's needed for verification

Output: Notes on repository structure, canonical sources found, and documentation topology.

Pass 1: Discovery and Classification

Discover all markdown files and classify them by purpose and lifecycle.

Create TodoWrite tasks for:

•Find all markdown files recursively
•Classify documents by type and lifecycle
•Identify obsolete candidates for archival

Activities:

•Use Glob with pattern **/*.md to find all documentation
•
Classify using multiple signals:
- •Path heuristics: docs/archive/*, files with dates in names
- •Commit recency: Use Bash with git log --format=%ci --max-count=1 -- <file> to check last update
- •Link graph centrality: Documents frequently linked from others are likely living docs
- •YAML frontmatter: Look for status: living|archival|planned
•
Apply decision rules:
- •docs/plans/* are point-in-time (never updated)
- •High recency + clear structure = living documentation
- •Archived paths + low recency = obsolete candidate

Output: Classified document inventory with assessment of which docs to audit vs archive.

Safety: Move obsolete documents to docs/archive/ rather than delete. Ask user before archiving documents with ambiguous classifications.

Pass 2: Claim Extraction and Investigation Planning

Extract verifiable claims from living documentation and plan verification strategy.

Create TodoWrite tasks for each document section being validated.

Activities:

•
For each living documentation file:
- •Use Read to load the document
- •
  Extract claims by type:
  - •Behavioral: "Service retries three times"
  - •Structural: "Class implements Interface"
  - •API: "Endpoint returns JSON with schema X"
  - •Configuration: "Setting defaults to value Y"
  - •Usage: "Run command with flag --foo"
•Record claim metadata: doc_path, line_range, type, referenced_symbols
•
Identify text-based diagrams and classify:
- •Normative: Must match code structure exactly (UML class diagrams)
- •Illustrative: Conceptual, verify broad relationships only
•
Analyze codebase structure for documentation gaps:
- •Packages/modules without documentation
- •Exported APIs without usage examples
- •Directories lacking README files
•Build investigation plan grouping claims by symbol/module for batch efficiency

Output: List of claims to verify, organized by module/package for efficient verification.

Optimization: Group related claims together to share context during verification.

Pass 3: Verification and Investigation

Verify claims against code using risk-appropriate methods.

Create TodoWrite tasks for each investigation batch.

Verification Hierarchy (use highest confidence method available):

•Canonical sources: Verify against OpenAPI specs, protobuf definitions, GraphQL schemas, configuration schemas
•Symbolic analysis: Verify structural claims using mcp__serena__find_symbol, mcp__serena__find_referencing_symbols, mcp__serena__search_for_pattern
•Deep investigation: Use mcp__zen__analyze, mcp__zen__debug, or mcp__zen__thinkdeep for complex behavioral claims

Activities:

•Process investigation batches from Pass 2
•
For each claim:
- •Select verification method based on claim type and available evidence
- •Start with symbolic analysis to narrow scope before deep investigation
- •Document findings with evidence trails (file paths, line numbers, symbol names)
- •Record whether claim is verified, contradicted, or requires user review
•
Verify diagrams:
- •Parse diagram syntax to extract assertions about code structure
- •Compare against symbol graph (normative) or verify broad structure (illustrative)
•
Create documentation for identified gaps:
- •Missing README files in documented directories
- •Undocumented features found in Pass 0 indexing
•
Handle conflicts:
- •Apply authority hierarchy: canonical sources > generated docs > top-level README > service READMEs
- •Record conflicts with context (version, environment, feature flags)
- •Annotate rather than delete when conflicts may be contextual

Output: Verified claims with corrections noted, new documentation drafts for gaps, conflict records.

Risk Management:

•Assign confidence to all verifications (exploring, low, medium, high, very_high, almost_certain, certain)
•Flag low-confidence verifications for user review
•Prefer static verification over runtime checks (safer, more deterministic)
•Never auto-fix based on weak evidence

Pass 4: Risk-Tiered Repair and Reporting

Apply corrections based on risk assessment.

Risk Tiers:

Auto-fix (apply immediately):

•Broken internal links (update paths)
•Typos in code references (symbol renamed in codebase)
•Outdated paths (files moved)
•Missing table of contents
•Diagram syntax errors

User approval required (present for review):

•Substantive technical corrections (behavior claims)
•Claim deletions or rewrites
•Document reclassifications
•Structural changes
•Conflict resolutions

User review required (flag but don't auto-fix):

•Low-confidence verifications
•Conflicting claims across documents
•Planned features (add disclaimers, don't verify behavior)
•Destructive actions (archival, removal)

Activities:

•Apply auto-fixes immediately using Edit or Write tools
•
For user-approval changes:
- •Present diff preview
- •Show evidence trails (code spans, symbols verified, canonical sources)
- •Include confidence scores and rationale
- •Wait for user approval before applying
•Move obsolete documents to docs/archive/ (after approval)
•Create new README files for undocumented directories
•Insert gap documentation using Write or Edit
•Update navigation indexes
•
Generate summary report:
- •Changes by category and risk tier
- •Coverage metrics (claims verified / total claims)
- •Remaining manual review items
- •Confidence distribution

Output: Applied fixes, pending changes for review, archived documents, comprehensive summary.

Authority Hierarchy

When multiple documents make conflicting claims:

•Canonical sources (OpenAPI, protobuf, schemas)
•Generated documentation (godoc, JSDoc)
•Top-level README.md
•Service/module READMEs
•Design documents
•Ad-hoc notes

Document Classification Heuristics

Living Documentation (keep current):

•High commit frequency (updated within last 6 months)
•Clear ownership or maintenance signals
•Linked from README or navigation
•Path patterns: README.md, CONTRIBUTING.md, ARCHITECTURE.md, top-level docs

Point-in-Time (never update):

•Under docs/plans/
•Contains specific dates or version numbers
•Marked with status: plan or status: archival
•Low change frequency (<2 updates ever)

Obsolete (archive):

•No updates in 12+ months
•References removed code
•Superseded by newer documentation
•Marked with DRAFT or TODO

Skill Parameters

Accept these parameters when invoked:

•dry_run (default: true): Generate report without applying changes
•risk_tier (default: user_approval): Maximum risk tier to auto-apply (auto_fix | user_approval)
•verification_method (default: auto): Force specific verification approach (auto | canonical | symbolic | deep)
•focus_paths (default: all): Restrict to specific directories (e.g., "docs/", "README.md")
•skip_gap_analysis (default: false): Skip documentation gap detection

Usage Workflow

Step 1: Announce and Initialize

code

"I'm using the documentation-audit skill to verify documentation accuracy."

Create TodoWrite tasks for all four passes plus a completion task.

Step 2: Execute Pass 0 - Repository Indexing

Follow Pass 0 activities above. Keep this lightweight - just enough to understand repository structure and find canonical sources.

Step 3: Execute Pass 1 - Discovery and Classification

Follow Pass 1 activities above. Classify all markdown files and identify obsolete candidates.

Step 4: Execute Pass 2 - Claim Extraction

Follow Pass 2 activities above. Extract all verifiable claims and create investigation plan.

Step 5: Execute Pass 3 - Verification

Follow Pass 3 activities above. Verify each claim using appropriate verification hierarchy.

Step 6: Execute Pass 4 - Repair

Follow Pass 4 activities above. Apply fixes according to risk tiers and present changes for review.

Step 7: Summary

Provide conversational summary:

•Total claims verified
•Auto-fixes applied
•Changes requiring approval
•Items requiring manual review
•Overall documentation health assessment

Tools Available

•Serena symbolic tools: mcp__serena__find_symbol, mcp__serena__find_referencing_symbols, mcp__serena__search_for_pattern, mcp__serena__get_symbols_overview
•Zen investigation tools: mcp__zen__analyze, mcp__zen__debug, mcp__zen__thinkdeep, mcp__zen__codereview
•File operations: Read, Write, Edit, Glob, Grep
•Version control: Bash with git commands for commit history

Token Efficiency Strategies

•Index once, reuse: Pass 0 creates repository knowledge for later passes
•Batch similar claims: Group by symbol/module to share context
•Retrieval-augmented: Fetch only relevant code spans for verification
•Prefer static: Use symbolic analysis before deep investigation
•Focused reading: Use mcp__serena__get_symbols_overview before reading full files

Safety Mechanisms

•Evidence trails: Every change documents source claim, verification method, evidence
•Confidence scoring: Low confidence blocks auto-fixes
•Diff previews: Show changes before applying
•Archival over deletion: Move obsolete docs, don't delete
•Risk tiers: Separate auto-fixable issues from those requiring approval

Edge Cases

Feature flags: Claims may be contextual (true when flag enabled). Record context, don't mark as contradicted.

Multiple versions: Documentation for different versions may coexist. Partition claims by version.

Planned features: Add disclaimer banners, mark verification: not_applicable. Don't attempt behavioral verification.

External dependencies: Claims about external APIs may be stale. Verify against canonical sources if available, otherwise flag for review.

Runtime behavior: Claims requiring execution (performance, flakiness) are fragile. Prefer symbolic verification or mark for manual testing.

Example Execution

code

User: Run documentation audit on docs/ with auto-fixes enabled

Claude:
1. "I'm using the documentation-audit skill to verify documentation accuracy."
2. Creates TodoWrite for 4 passes + summary
3. Pass 0: Indexes repository (symbols, canonical sources, doc links)
4. Pass 1: Finds 23 markdown files, classifies 3 as obsolete candidates
5. Pass 2: Extracts 156 claims from 20 living docs, groups into 12 investigation batches
6. Pass 3: Verifies claims using symbolic analysis (78%), canonical sources (12%), deep investigation (10%)
   - Finds 8 contradicted claims, 12 documentation gaps
7. Pass 4:
   - Auto-fixes: 3 broken links, 1 outdated path
   - Presents for approval: 8 technical corrections, 12 new documentation sections
   - Flags for review: 2 low-confidence verifications
8. Summary: "Verified 156 claims with 95% confidence. Applied 4 auto-fixes, 20 changes await approval, 2 require manual review."

Anti-Patterns

❌ Don't skip symbolic analysis - Use serena tools before reading full files ❌ Don't auto-fix low-confidence findings - Flag for user review instead ❌ Don't delete documents - Archive to docs/archive/ instead ❌ Don't batch all changes - Apply auto-fixes immediately, present substantive changes for review ❌ Don't verify point-in-time documents - Leave docs/plans/* alone ❌ Don't skip TodoWrite - Track progress through all four passes

Summary

This skill performs rigorous documentation audits through four systematic passes:

•Index repository structure and canonical sources
•Classify documents by lifecycle and purpose
•Extract and group verifiable claims
•Verify claims and apply risk-tiered repairs

90% of effort is verification against code, 10% is structure and quality.