Brownfield Project Documentation Generator

Generate comprehensive architecture documentation automatically by analyzing existing codebase structure, patterns, and conventions.

Purpose

Analyze an existing codebase and generate three comprehensive documentation files:

•architecture.md - Project structure, tech stack, data models, API specifications
•standards.md - Coding standards, best practices, discovered conventions
•patterns.md - Design patterns, architectural patterns, common conventions

This enables BMAD Enhanced to work with brownfield projects by reverse-engineering architecture from code.

When to Use This Skill

This skill should be used when:

•Starting BMAD Enhanced with existing project (brownfield onboarding)
•Architecture documentation is outdated or missing
•Need to discover implicit patterns and conventions
•Onboarding to unfamiliar codebase

This skill should NOT be used when:

•Greenfield projects (write docs from scratch instead)
•Project already has current, comprehensive documentation
•Codebase >500K lines (too complex for automated analysis)
•Project has no clear structure

Project Type Support

Well-Supported Languages:

•✅ Node.js/TypeScript (excellent support)
•✅ Python (good support)
•✅ Go (good support)
•✅ Java/Kotlin (good support)
•✅ Rust (good support)

Basic Support:

•⚠️ PHP, Ruby, C#/.NET (basic support)

Optimal Codebase Size: 10K-100K lines

•Smaller: May lack sufficient patterns to analyze
•Larger: Analysis may be too slow or complex

Sequential Documentation Generation

Execute steps in order - each builds on previous analysis:

Step 0: Configuration and Validation

Purpose: Verify project is suitable for automated documentation.

Actions:

•Load config from .claude/config.yaml (codebasePath, existingDocs, includeTests, maxFiles)
•Validate structure: Check path exists, identify languages, count files/lines, verify size (10K-100K recommended)
•Check existing docs: Ask how to handle (merge/replace/supplement)
•Get confirmation: Show summary (path, language, file count, lines, estimated time), ask to proceed

See: references/templates.md#step-0-configuration-and-validation-output for complete formats

Halt if:

•Codebase path not found
•No recognizable project structure
•Codebase too large (>500K lines)
•Unsupported language
•User declines to proceed

Output: Validation confirmation (project type, lines, existing docs mode, scope, ready status)

Reference: See validation-criteria.md for detailed validation rules.

Step 1: Analyze Project Structure

Purpose: Map file organization and module structure.

Actions:

•Scan directory structure: Identify main dirs, detect patterns (feature vs type), note nesting
•Analyze file organization: Files per dir, naming conventions, size distribution, co-location patterns
•Detect project type: Backend API / Frontend App / Full-Stack / Library / Monorepo (based on directory structure)
•Map relationships: Analyze imports, build dependency graph, identify core modules, detect circular dependencies

Output: Project structure analysis with type, organization, directory structure (dirs + file counts + line counts), key patterns

See: references/templates.md#step-1-codebase-analysis-output for complete format

Reference: See analysis-techniques.md for detailed analysis methods.

Step 2: Analyze Technology Stack

Purpose: Identify languages, frameworks, and dependencies.

Actions:

•
Read package configuration:
- •Node.js: package.json
- •Python: requirements.txt, pyproject.toml
- •Go: go.mod
- •Java: pom.xml, build.gradle
- •Rust: Cargo.toml
•
Extract dependencies and identify frameworks:
- •Backend: Express, Django, Spring Boot, etc.
- •Frontend: React, Vue, Angular, etc.
- •Database: Prisma, TypeORM, SQLAlchemy, etc.
- •Testing: Jest, Pytest, JUnit, etc.
•
Detect runtime/platform:
- •Node.js/Python/JDK version
- •Database type (PostgreSQL, MySQL, MongoDB)

Output: Tech stack summary (runtime, backend/frontend frameworks, key libraries, testing tools, versions, confidence score)

See: references/templates.md#step-2-technology-stack-analysis for complete format

Step 3: Extract Data Models and Schemas

Purpose: Document data structures and validation rules.

Actions:

•
Locate data model files:
- •Prisma schema: prisma/schema.prisma
- •TypeScript interfaces: src/types/*.ts, src/models/*.ts
- •Database migrations: prisma/migrations/, migrations/
- •Validation schemas: Zod, Yup, Joi schemas
•
Parse data models and extract validation rules
•
Analyze relationships:
- •One-to-many, many-to-many
- •Foreign keys and constraints
•
Detect data flow:
- •Request → Validation → Service → Repository → Database
- •Response transformation (DTOs)

Output: Data models summary (models with fields/types/constraints, validation rules, relationships, confidence score)

See: references/templates.md#complete-architecture-md-template for data model format

Reference: See analysis-techniques.md for model extraction patterns.

Step 4: Analyze API Patterns

Purpose: Document API structure and conventions.

Actions:

•
Locate API definitions:
- •Express routes: src/routes/**/*.ts
- •Controllers: src/controllers/**/*.ts
- •OpenAPI/Swagger spec (if exists)
- •GraphQL schemas (if exists)
•
Extract endpoints and analyze request/response patterns:
- •Request validation (middleware)
- •Error response format
- •Success response format
- •Status codes used
•
Identify authentication:
- •JWT tokens, session-based, API keys, OAuth
•
Detect rate limiting and other middleware

Output: API specs summary (base URL, authentication type/headers/expiry, error/success response formats, confidence score)

See: references/templates.md#complete-architecture-md-template for API specification format

Step 5: Extract Coding Standards and Patterns

Purpose: Document implicit conventions and best practices.

Actions:

•
Analyze code style:
- •Read .eslintrc, .prettierrc, tsconfig.json
- •Detect: indentation, quotes, semicolons
- •Naming conventions: variables, functions, classes, files
•
Identify architectural patterns:
- •Design patterns: Repository, Factory, Strategy
- •Architectural style: Layered, Clean, Hexagonal
•
Extract error handling patterns:
- •Error classes
- •Centralized error handling
- •Logging patterns
•
Detect testing patterns:
- •Test organization
- •Mocking strategy
- •Test data management

Output:

code

Coding Standards:

Code Style:
- Indentation: 2 spaces
- Quotes: Single quotes
- Naming: camelCase (variables), PascalCase (classes), kebab-case (files)

Architectural Patterns:
1. Layered Architecture
   - Routes (presentation layer)
   - Services (business logic layer)
   - Repositories (data access layer)

2. Dependency Injection
3. Repository Pattern

Error Handling:
- Custom error classes (AppError, ValidationError)
- Centralized error handling middleware
- Never expose stack traces to clients

Confidence: High (90%)

Reference: See pattern-detection.md for pattern identification techniques.

Step 6: Generate Architecture Documentation

Purpose: Create comprehensive architecture.md from findings.

Actions:

•
Load architecture template structure
•
Populate sections with analyzed data from Steps 1-5:
- •Overview, tech stack, project structure
- •Data models, API specifications
- •Include confidence scores where applicable
- •Add source file references
•
Generate diagrams (optional):
- •Mermaid diagrams for architecture layers
- •Data model ERD
- •API endpoint map
•
Add human review notes for medium/low confidence sections
•
Write documentation file:
- •Create docs/architecture.md
- •Or merge with existing docs if "merge" mode selected

Output:

code

✓ Architecture documentation generated
✓ File: docs/architecture.md (2,450 lines)
✓ Sections: 12
✓ Overall confidence: 85%
✓ Human review items: 5

Reference: See documentation-templates.md for template structures.

Step 7: Generate Standards Documentation

Purpose: Create standards.md from discovered patterns.

Actions:

•
Extract standards from code analysis:
- •Security standards (from security practices)
- •Testing standards (from test patterns)
- •Code quality standards (from ESLint rules)
- •Performance standards (from observed patterns)
•
Document best practices:
- •Observed consistently across codebase
- •Mark as "discovered" vs "recommended"
•
Create standards document with examples and consistency scores

Output:

code

✓ Standards documentation generated
✓ File: docs/standards.md (850 lines)
✓ Standards extracted: 18
✓ Consistency scores: 75-100%

Step 8: Generate Patterns Documentation

Purpose: Document discovered design patterns and conventions.

Actions:

•
Extract design patterns:
- •Repository, Factory, Strategy, Middleware patterns
•
Document usage examples:
- •Show code examples of each pattern
- •Explain when to use
- •Link to existing implementations
•
Identify anti-patterns:
- •Code smells detected
- •Inconsistencies
- •Technical debt
•
Create patterns document

Output:

code

✓ Patterns documentation generated
✓ File: docs/patterns.md (620 lines)
✓ Patterns identified: 8
✓ Anti-patterns noted: 3

Reference: See pattern-detection.md for pattern documentation templates.

Step 9: Validation and Confidence Scoring

Purpose: Score accuracy and identify areas needing human review.

Actions:

•Calculate confidence: Score each section (tech stack, structure, data models, API, standards) and compute overall
•Identify low-confidence areas: Find sections <70%, conflicting patterns, missing info
•Generate review checklist: List high/medium priority items needing human verification
•Create validation report

Output: Validation summary (overall confidence %, high/medium priority review item counts)

See: references/templates.md#review-checklist-template and references/confidence-scoring.md for formats

Reference: See confidence-scoring.md for scoring guidelines.

Step 10: Summary and Next Steps

Purpose: Provide user with clear summary and action items.

Actions:

•Generate summary report: Project name, duration, files analyzed, confidence, generated docs (paths + line counts), key findings (type, arch, tech stack, test coverage), next steps
•Create review checklist file: docs/REVIEW_CHECKLIST.md
•Update config: Set brownfield flag, doc paths, documented=true
•Prompt next action: Review items, create task spec, run index-docs, or exit

Output: Summary confirmation (report generated, checklist created, config updated, ready status)

See: references/templates.md for complete summary formats

Confidence Scoring Guidelines

High (85-100%): Explicit in code/config, consistent patterns, no conflicts | Medium (70-84%): Inferred from patterns, some inconsistencies, needs validation | Low (<70%): Missing/unclear info, conflicts, high uncertainty, MUST review

See: references/confidence-scoring.md for detailed scoring methodology

Limitations

Cannot: Understand business logic without context, document deployment infrastructure, capture tribal knowledge, understand legacy decisions, document external integrations perfectly

Requires: Readable structured codebase, standard organization, supported language/framework, 10K-100K lines (optimal)

Best Practices

Run periodically (every 3-6 months) | Always review low-confidence sections | Supplement with manual docs (business context, deployment) | Use as starting point, enhance with team input

Integration with Planning Workflow

Brownfield: document-project → index-docs → create-task-spec (use generated docs) | Greenfield: Write docs manually first | Brownfield: Generate from code, then refine

References

Detailed documentation in references/:

•templates.md: All output formats, complete architecture/standards/patterns templates, analysis summaries, review checklists, error templates
•analysis-techniques.md: Analysis methods for structure, tech stack, models, APIs, patterns
•pattern-detection.md: Pattern identification and documentation techniques
•documentation-templates.md: Templates for architecture, standards, patterns docs
•confidence-scoring.md: Confidence calculation methodology and validation criteria
•validation-criteria.md: Project validation rules and sizing guidelines