AGENTS.md Authoring Guide (Mark)

Goal: Write effective, high-signal-to-noise AGENTS.md files that outperform skills-based approaches (100% vs 79% pass rate based on Vercel research).

When to Use This Skill

•Creating a new AGENTS.md file
•Updating an existing AGENTS.md
•Optimizing agent context for better performance
•Converting from skills-based approach to AGENTS.md
•Setting up project documentation for AI agents
•Monorepo with nested AGENTS.md files

When NOT to Use This Skill

•Writing end-user documentation (use writing-docs-majo)
•Creating a new skill (use skill-authoring-majo)
•Small project with obvious conventions (AGENTS.md may be overkill)
•Documentation for non-agent consumption

Process

•Check existing AGENTS.md - Read current file if it exists
•Measure current token count - Target root file ≤ 800 tokens
•
Apply 6-Core-Areas Framework:
- •Commands (~80 tokens) - Exact copy-paste commands
- •Testing (~50 tokens) - Framework, location, coverage, run command
- •Project Structure (~50 tokens) - Strategic hints only
- •Code Style (~75 tokens) - One real example with positive & negative
- •Git Workflow (~40 tokens) - Commit format, branch naming, PR requirements
- •Boundaries (~50 tokens) - Always/Ask/Never three-tier system
•Use real code examples - From the actual project, not templates
•Apply progressive disclosure - Link to detailed docs, don't inline
•Add three-tier boundaries - ALWAYS/ASK FIRST/NEVER format
•Verify token count - Ensure root ≤ 800 tokens
•Test with real tasks - Run agents on actual work, observe mistakes
•Iterate - Update AGENTS.md after each agent mistake
•Update AGENTS.md - Document that AGENTS.md was modified

Constraints

•ALWAYS keep root AGENTS.md ≤ 800 tokens - Maximize signal-to-noise
•ALWAYS minimize root file - Strategic structure over exhaustive docs
•ALWAYS use six-core-areas framework - Commands, Testing, Structure, Style, Git, Boundaries
•ALWAYS use three-tier boundaries (Always/Ask/Never) - More effective than flat lists
•NEVER include file structure documentation - Agents infer this; it drifts
•NEVER duplicate README/docs - Link instead
•NEVER use vague guidance - "Be careful" → "Never do X without..."
•UPDATE immediately after agent mistakes - Track in git

Write effective AGENTS.md files that outperform skills-based approaches.

Core Principle

Minimize root file; maximize signal-to-noise ratio through strategic structure, not exhaustive documentation.

AGENTS.md is loaded on every agent request. Keep it concise but information-dense.

AGENTS.md vs Skills: When to Use Each

Approach	Use When	Performance
AGENTS.md	Project-specific knowledge, always-needed context, version-matched docs	100% pass rate
Skills	Cross-project patterns, progressive disclosure, large reference material	79% pass rate

Key Finding (Vercel Research): AGENTS.md achieved 100% pass rate vs 79% with skills, even with explicit skill invocation instructions.

Why AGENTS.md Wins:

•No decision point - always loaded
•Consistent availability - in system prompt every turn
•No ordering issues - no "read docs first vs explore project first" dilemma

Use AGENTS.md for:

•Domain concepts specific to your project
•Non-obvious conventions agents can't infer
•Executable commands with exact flags
•Version-matched framework documentation
•Project boundaries and gotchas

Use Skills for (see skill-authoring-majo):

•Cross-project patterns
•Large content requiring progressive disclosure
•Workflows that don't fit in AGENTS.md
•When context bloat is a concern

Six-Core-Areas Framework

Structure your AGENTS.md around six key areas (300-800 tokens total):

1. Commands (~80 tokens)

Exact, copy-paste-ready commands with flags.

markdown

## Build Commands

```bash
# Build the package
pnpm build

# Development mode
pnpm dev

# Run tests
pnpm test

code


**Why**: Agents reference repeatedly; vague instructions create clarification loops.

### 2. Testing (~50 tokens)

Framework, location, coverage minimum, run command.

```markdown
## Testing

Framework: Jest
Location: tests/
Coverage: 80% minimum
Run: pnpm test

Don't include philosophy or lengthy examples.

3. Project Structure (~50 tokens)

Strategic hints, not exhaustive paths.

markdown

## Project Structure

src/ — Application code (READ)
tests/ — Unit/integration tests
docs/ — Markdown (WRITE)

Agents discover files well; structure drifts. Skip detailed file listings.

4. Code Style (~75 tokens)

One real repo example showing BOTH positive and negative patterns.

markdown

## Code Style

```python
# GOOD: Use retry() for polling/waiting
from next_test_utils import retry
await retry(async () => {
    const text = await browser.elementByCss('p').text()
    expect(text).toBe('expected value')
})

# AVOID: Don't use setTimeout for waiting
await new Promise((resolve) => setTimeout(resolve, 1000))

code


One example > three paragraphs of description. Show the anti-pattern too.

### 5. Git Workflow (~40 tokens)

Commit format, branch naming, PR requirements.

```markdown
## Git Workflow

Commits: feat(scope): description (Conventional Commits)
Branches: feature/*, bugfix/*, docs/*
PRs: lint ✓, tests ✓, focused diff

6. Boundaries (~50 tokens)

Three-tier system: Always/Ask/Never (more effective than flat "don't" lists).

markdown

## Boundaries

ALWAYS: Run tests before commits; follow naming conventions
ASK FIRST: Before schema changes; before new dependencies; before CI/CD changes
NEVER: Commit secrets; edit node_modules; remove failing tests without approval

Why three tiers: Eliminates agent ambiguity better than flat lists.

Token Efficiency Techniques

Progressive Disclosure

Link to detailed docs instead of inlining:

markdown

Detailed patterns: [see docs/api.md §3.2]
Quick reference: Endpoints return { "data": {...}, "error": null }

Compression

Use pipe-delimited format for complex concepts. Vercel achieved 80% reduction (40KB → 8KB):

markdown

[Docs Index]|root: ./.docs
|01-getting-started:{installation.md,project-structure.md}
|02-api:{routing.md,caching.md}

Hierarchical Summarization

Extended TOC at top; full sections below. Keeps frequent tasks in focus.

What to Include vs Exclude

INCLUDE:

•Domain concepts specific to your project ("Organization" vs "Workspace")
•Non-obvious conventions agents can't infer ("Use composition over inheritance")
•Executable commands with exact flags
•Boundaries and gotchas from real experience ("This library has a memory leak in v3.2.1")
•Real code examples from your repo (not templates)

EXCLUDE:

•File structure documentation (agents infer this; it drifts)
•Duplicates of README/docs (link instead)
•Abstract best practices ("Write clean code")
•Lengthy prose (every section = 10-second scan)
•Vague guidance ("Be careful with X" → "Never do X without...")
•Generic advice that overlaps across projects

Example AGENTS.md Structure

Based on Vercel's Next.js AGENTS.md:

markdown

# Project Development Guide

## Quick Start

```bash
# Install dependencies
pnpm install

# Start development
pnpm dev

# Run tests
pnpm test

Project Structure

src/ — Application code tests/ — Test files docs/ — Documentation

Build Commands

bash

pnpm build      # Production build
pnpm dev        # Development server
pnpm test       # Run all tests
pnpm test:watch # Watch mode

Testing

Framework: Vitest Location: tests/ Coverage: 80% minimum

Writing Tests

typescript

// GOOD: Use retry() for polling
await retry(async () => {
    expect(await page.text()).toBe('done')
})

// AVOID: Don't use setTimeout
await new Promise(r => setTimeout(r, 1000))

Code Style

•TypeScript strict mode
•Prettier for formatting
•ESLint for linting

Git Workflow

Commits: feat(scope): description Branches: feature/, bugfix/ PRs: tests pass, lint clean

Boundaries

ALWAYS: Run tests before push; update AGENTS.md after agent mistakes ASK FIRST: Before adding dependencies; before architectural changes NEVER: Commit .env files; modify generated code directly

Common Issues

•Build fails after branch switch → Run pnpm build
•Tests timeout → Check if dev server is running
•Type errors → Run pnpm types to check

code


## Nested AGENTS.md (Monorepos)

For monorepos, use nested AGENTS.md files:

project/ ├── AGENTS.md # Global rules ├── packages/ │ ├── backend/ │ │ └── AGENTS.md # Backend-specific │ └── frontend/ │ └── AGENTS.md # Frontend-specific

code


Agents use the nearest AGENTS.md (like .gitignore). Keeps context tight and relevant.

## Iterative Refinement Process

1. **Start minimal**: Six-core-areas, ~400 tokens
2. **Run agents on real tasks**: Observe mistakes
3. **When agent misunderstands/fails**: Note it
4. **Add clarifying rule/example/boundary to AGENTS.md**
5. **Tell agent**: "I updated AGENTS.md. Given these new rules, refactor accordingly."
6. **Repeat**: Each iteration tightens spec, reduces future mistakes

**Example iterations**:
- Iteration 1: Agent uses old API → Add: "Use v2/users endpoint, not v1/users"
- Iteration 2: Agent skips validation → Add: "Always validate ID format"
- Iteration 3: Agent modifies tests when not asked → Add boundary: "Never modify tests unless explicitly asked"

## Anti-Patterns to Avoid

| Anti-Pattern | Problem | Fix |
|---|---|---|
| Vague rules | Agent asks clarifying questions (wasted tokens) | Be specific: "Never commit secrets" not "Be careful with secrets" |
| Outdated AGENTS.md | Agent ignores stale guidance or hallucinates | Update immediately after mistakes; track in git |
| Monolithic structure | Agent loads irrelevant context every task | Use nested AGENTS.md; progressive disclosure |
| Duplicate docs | Token waste; maintenance burden | Link instead of inlining |
| Abstract examples | Agents don't know how to apply guidance | Use real files: "copy DashForm.tsx" |
| Conflicting rules | Agent uncertain which rule applies | Use Always/Ask/Never tiers, not flat lists |

## Quick Checklist

- [ ] Root ≤ 800 tokens
- [ ] Six-core-areas framework present
- [ ] Three-tier boundaries (Always/Ask/Never)
- [ ] Real code examples (positive & negative)
- [ ] Links to external docs, not duplicates
- [ ] No file structure documentation
- [ ] No vague guidance
- [ ] Updated after every agent mistake
- [ ] Nested AGENTS.md for monorepos (if applicable)
- [ ] Progressive disclosure for large docs

## Testing Skills

- **Token count check**: Ensure root AGENTS.md ≤ 800 tokens (use wc or estimate)
- **Structure verification**: Confirm all 6 core areas are present
- **Boundary clarity**: Check Always/Ask/Never tiers are specific, not vague
- **Real examples test**: Verify code examples actually exist in the repo
- **Link validation**: Ensure linked docs exist and are accessible
- **Agent run test**: Perform a real task, verify agent follows AGENTS.md guidance
- **Iteration tracking**: Document each update and the mistake it fixed

## Integration with Other Skills

This skill extends `dev-standards-majo`. Always ensure `dev-standards-majo` is loaded for:
- AGENTS.md maintenance
- Universal code principles
- Documentation policies

Works alongside:
- `skill-authoring-majo` — For deciding when to use AGENTS.md vs create skills
- `task-planning-majo` — For planning AGENTS.md updates
- `git-majo` — For committing AGENTS.md changes
- `writing-docs-majo` — For writing documentation about AGENTS.md

## References

- Vercel Research: ["AGENTS.md outperforms skills in our agent evals"](https://vercel.com/blog/agents-md-outperforms-skills-in-our-agent-evals) (100% vs 79% pass rate)
- Next.js AGENTS.md: https://github.com/vercel/next.js/blob/canary/AGENTS.md
- agents.md specification: https://agents.md/