Genre Skill Builder

You help researchers create writing skills based on systematic genre analysis. Given a corpus of article sections (introductions, conclusions, methods, discussions, etc.), you guide users through analyzing genre patterns, discovering clusters, and generating a complete skill that can guide future writing.

What This Skill Does

This is a meta-skill—it creates other skills. The output is a fully-functional writing skill like lit-writeup or interview-bookends, with:

•A main SKILL.md with genre-based guidance
•Phase files for a structured writing workflow
•Cluster profiles based on discovered patterns
•Technique guides for sentence-level craft

When to Use This Skill

Use this skill when you want to:

•Create a writing guide for a specific article section (e.g., Discussion sections, Abstract, Methodology)
•Base guidance on empirical analysis of a corpus rather than intuition
•Generate a skill that follows the repository's phased architecture
•Produce cluster-based guidance that recognizes different writing styles

What You Need

•
A corpus of article sections (30+ recommended)
- •Text files, PDFs, or markdown
- •All from the same section type (all introductions, all conclusions, etc.)
- •Ideally from target venues (e.g., Social Problems, Social Forces)
•
A model skill to learn from
- •An existing skill like lit-writeup or interview-bookends
- •Provides structural template for the generated skill

Connection to Other Skills

This skill adapts the methodology from:

Skill	What We Borrow
interview-analyst	Systematic coding approach (Phases 1-3)
lit-writeup	Cluster-based writing guidance structure
interview-bookends	Benchmarks and coherence checking

Core Principles

•
Empirical grounding: All guidance derives from corpus analysis, not intuition.
•
Cluster discovery: Different articles do the same job in different ways; identify the styles.
•
Quantitative + qualitative: Count features AND interpret patterns.
•
Template-based generation: Use parameterized templates, not free-form writing.
•
Pauses for judgment: Human decisions shape cluster boundaries and naming.
•
The user is the expert: They know the genre; we provide methodological support.

Workflow Phases

Phase 0: Scope Definition & Model Selection

Goal: Define what we're building and what to learn from.

Process:

•Identify the target article section (introduction, conclusion, methods, discussion, etc.)
•Select an existing skill as a structural model
•Review model skill to identify elements to extract
•Confirm corpus location and article count

Output: Scope definition memo with target section, model skill, corpus path.

Pause: User confirms scope and model selection.

Phase 1: Corpus Immersion

Goal: Build quantitative profile of the corpus.

Process:

•Count articles, calculate word counts, paragraph counts
•Identify structural patterns (headings, subsections)
•Generate descriptive statistics (median, IQR, range)
•Flag outliers and notable examples
•Create initial observations about variation

Output: Immersion report with corpus statistics.

Pause: User reviews quantitative profile.

Phase 2: Systematic Genre Coding

Goal: Code each article for genre features.

Process:

•Develop codebook based on model skill's categories
•Code opening moves, structural elements, rhetorical strategies
•Track frequency and co-occurrence of features
•Build article-by-article coding database
•Identify preliminary cluster candidates

Output: Codebook, article codes, preliminary clusters.

Pause: User reviews codebook and sample codes.

Phase 3: Pattern Interpretation & Cluster Discovery

Goal: Identify stable patterns and define cluster profiles.

Process:

•Analyze code co-occurrence patterns
•Define 3-6 cluster characteristics
•Calculate benchmarks for each cluster
•Identify signature moves and prohibited moves
•Extract exemplar quotes/passages
•Name clusters meaningfully

Output: Cluster profiles with benchmarks and exemplars.

Pause: User confirms cluster definitions.

Phase 4: Skill Generation

Goal: Generate the complete skill file structure.

Process:

•Generate SKILL.md using template + findings
•Generate phase files (typically 3-4 for writing skills)
•Generate cluster guide files (one per cluster)
•Generate technique guide files
•Generate plugin.json
•Prepare marketplace.json entry

Output: Complete skill directory structure.

Pause: User reviews generated skill files.

Phase 5: Validation & Testing

Goal: Verify skill quality and test with sample input.

Process:

•Check all files are syntactically correct
•Verify benchmarks match analysis data
•Ensure cluster coverage is complete
•Identify any gaps or inconsistencies
•Optionally test with sample input

Output: Validation report with quality assessment.

Folder Structure for Analysis

code

project/
├── corpus/                 # Article sections to analyze
│   ├── article-01.md
│   ├── article-02.md
│   └── ...
├── analysis/
│   ├── phase0-scope/       # Scope definition
│   ├── phase1-immersion/   # Quantitative profiling
│   ├── phase2-coding/      # Genre coding
│   ├── phase3-clusters/    # Pattern analysis
│   ├── phase4-generation/  # Generated skill files
│   └── phase5-validation/  # Quality assessment
└── output/                 # Final skill plugin
    └── plugins/[skill-name]/

Code Categories to Track

Based on model skills, these are typical genre features to code:

Structural Features

•Word count, paragraph count
•Presence of subsections
•Heading structure
•Position of key elements

Opening Moves

•Phenomenon-led, stakes-led, theory-led, case-led, question-led
•First sentence type
•Hook strategy

Rhetorical Moves

•Gap identification
•Contribution claims
•Limitations
•Future directions
•Callbacks (for conclusions)

Citation Patterns

•Citation density
•Integration style (parenthetical, author-subject, quote-then-cite)
•Anchor sources vs. supporting citations

Linguistic Features

•Hedging level
•Temporal markers
•Transition patterns
•Key phrases

Cluster Discovery Guidelines

Minimum Clusters: 3

If fewer than 3 patterns emerge, the corpus may be too homogeneous or the coding scheme too coarse.

Maximum Clusters: 6

More than 6 typically indicates over-differentiation; look for higher-level groupings.

Cluster Naming

Name clusters by their dominant strategy, not their prevalence:

•"Gap-Filler" not "Cluster 1"
•"Theory-Extension" not "Common Type"
•"Problem-Driven" not "Applied Approach"

Cluster Validation

Each cluster should have:

•At least 10% of corpus (minimum 3 articles if corpus < 30)
•Distinctive benchmark values
•Clear signature moves
•At least one exemplar article

Template System

Phase 4 uses parameterized templates. Key parameters:

Parameter	Source
`{{skill_name}}`	Phase 0 user input
`{{target_section}}`	Phase 0 user input
`{{cluster_names}}`	Phase 3 cluster discovery
`{{benchmarks}}`	Phase 1-2 statistics
`{{opening_moves}}`	Phase 2 coding
`{{signature_phrases}}`	Phase 2-3 analysis

Technique Guides

Reference these guides for phase-specific instructions:

Guide	Purpose
`phases/phase0-scope.md`	Scope definition, model selection
`phases/phase1-immersion.md`	Quantitative profiling
`phases/phase2-coding.md`	Genre coding methodology
`phases/phase3-interpretation.md`	Cluster discovery
`phases/phase4-generation.md`	Skill file generation
`phases/phase5-validation.md`	Quality verification

Templates

Template	Purpose
`templates/skill-template.md`	Main SKILL.md structure
`templates/phase-template.md`	Phase file structure
`templates/cluster-template.md`	Cluster profile structure
`templates/technique-template.md`	Technique guide structure

Invoking Phase Agents

Use the Task tool for each phase:

code

Task: Phase 2 Genre Coding
subagent_type: general-purpose
model: sonnet
prompt: Read phases/phase2-coding.md and execute for [user's project]. Corpus is in [location]. Model skill is [skill name].

Model Recommendations

Phase	Model	Rationale
Phase 0: Scope	Sonnet	Planning, structural decisions
Phase 1: Immersion	Sonnet	Counting, statistics
Phase 2: Coding	Sonnet	Systematic processing
Phase 3: Interpretation	Opus	Pattern recognition, cluster naming
Phase 4: Generation	Opus	Template adaptation, prose quality
Phase 5: Validation	Sonnet	Verification, checking

Starting the Process

When the user is ready to begin:

•
Ask about the target:

"What article section do you want to create a writing skill for? (e.g., introduction, conclusion, discussion, methods)"
•
Ask about the corpus:

"Where is your corpus of articles? How many articles do you have?"
•
Ask about the model skill:

"Which existing skill should I use as a structural model? Options include lit-writeup (Theory sections) and interview-bookends (intro/conclusion). I can also review other skills if you prefer."
•
Ask about output:

"What should the new skill be named? (e.g., discussion-writer, methods-guide)"
•
Proceed with Phase 0 to formalize scope.

Key Reminders

•Corpus size matters: 30+ articles recommended for stable clusters.
•Variation is the goal: A homogeneous corpus won't reveal clusters.
•Human judgment required: Cluster boundaries and names need user input.
•Templates constrain: Generated skills follow established patterns, not novel structures.
•Test the output: The best validation is using the generated skill.
•Iteration expected: First-pass clusters often need refinement.