Orchestrator Directives Skill

Use this skill for delegation patterns and decision frameworks in orchestrator mode.

Trigger keywords: orchestrator, delegation, subagent, task coordination, parallel execution, cost-first, spawner

Quick Start - What is Orchestration?

Delegate tactical work to specialized subagents while you focus on strategic decisions. Save Claude Code context (expensive) by using FREE/CHEAP AIs for appropriate tasks.

Basic pattern:

python

Task(
    subagent_type="gemini",  # FREE - use for exploration
    description="Find auth patterns",
    prompt="Search codebase for authentication patterns..."
)

When to use: ALWAYS use for complex tasks requiring research, code generation, git operations, or any work that could fail and require retries.

For complete guidance: See sections below or run /multi-ai-orchestration for model selection details.

CRITICAL: Cost-First Delegation (IMPERATIVE)

Claude Code is EXPENSIVE. You MUST delegate to FREE/CHEAP AIs first.

<details> <summary>Cost Comparison & Pre-Delegation Checklist</summary>

PRE-DELEGATION CHECKLIST (MUST EXECUTE BEFORE EVERY TASK())

Ask these questions IN ORDER:

•
Can Gemini do this? → Exploration, research, batch ops, file analysis
- •YES = MUST use gemini spawner (FREE - 2M tokens/min)
•
Is this code work? → Implementation, fixes, tests, refactoring
- •YES = MUST use codex spawner (70% cheaper than Claude)
•
Is this git/GitHub? → Commits, PRs, issues, branches
- •YES = MUST use copilot spawner (60% cheaper, GitHub-native)
•
Does this need deep reasoning? → Architecture, complex planning
- •YES = Use Claude Opus (expensive, but strategically needed)
•
Is this coordination? → Multi-agent work
- •YES = Use Claude Sonnet (mid-tier)
•
ONLY if above fail → Haiku (fallback)

Cost Comparison Examples

Task	WRONG (Cost)	CORRECT (Cost)	Savings
Search 100 files	Task() ($15-25)	Gemini spawner (FREE)	100%
Generate code	Task() ($10)	Codex spawner ($3)	70%
Git commit	Task() ($5)	Copilot spawner ($2)	60%
Strategic decision	Direct task ($20)	Claude Opus ($50)	Must pay for quality

WRONG vs CORRECT Examples

code

WRONG (wastes Claude quota):
- Code implementation → Task(haiku)               # USE Codex spawner
- Git commits → Task(haiku)                       # USE Copilot spawner
- File search → Task(haiku)                       # USE Gemini spawner (FREE!)
- Research → Task(haiku)                          # USE Gemini spawner (FREE!)

CORRECT (cost-optimized):
- Code implementation → Codex spawner             # Cheap, sandboxed
- Git commits → Copilot spawner                   # Cheap, GitHub-native
- File search → Gemini spawner                    # FREE!
- Research → Gemini spawner                       # FREE!
- Strategic decisions → Claude Opus               # Expensive, but needed
- Haiku → FALLBACK ONLY                           # When spawners fail

</details>

Core Concepts

<details> <summary>Orchestrator vs Executor Roles</summary>

Orchestrator (You):

•Makes strategic decisions
•Delegates tactical work
•Tracks progress with SDK
•Coordinates parallel subagents
•Only executes: Task(), AskUserQuestion(), TodoWrite(), SDK operations

Executor (Subagent):

•Handles tactical implementation
•Researches specific problems
•Fixes issues with retries
•Reports findings back
•Consumes resources independently (saves your context)

Why separation matters:

•Context preservation (MUST prevent failures from compounding in your context)
•Parallel efficiency (MUST run multiple subagents simultaneously)
•Cost optimization (ALWAYS use cheaper subagents than Claude Code)
•Error isolation (MUST keep failures in subagent context)

</details> <details> <summary>Why Delegation Matters: Context Cost Model</summary>

What looks like "one bash call" becomes many:

•Initial command fails → need to retry
•Test hooks break → need to fix code → retry
•Push conflicts → need to pull/merge → retry
•Each retry consumes tokens

Context cost comparison:

code

Direct execution (fails):
  bash call 1 → fails
  bash call 2 → fails
  bash call 3 → fix code
  bash call 4 → bash call 1 retry
  bash call 5 → bash call 2 retry
  = 5+ tool calls, context consumed

Delegation (cascades isolated):
  Task(subagent handles all retries) → 1 tool call
  Read result → 1 tool call
  = 2 tool calls, clean context

Token savings:

•Each failed retry: 2,000-5,000 tokens wasted
•Cascading failures: 10,000+ tokens wasted
•Subagent isolation: None of that pollution in orchestrator context

</details> <details> <summary>Decision Framework: When to Delegate vs Execute</summary>

Ask yourself these questions:

•
Will this likely be ONE tool call?
- •Uncertain → DELEGATE
- •Certain → MAY do directly (single file read, quick check)
•
Does this require error handling?
- •If yes → DELEGATE (subagent handles retries)
•
Could this cascade into multiple operations?
- •If yes → DELEGATE
•
Is this strategic or tactical?
- •Strategic (decisions) → Do directly
- •Tactical (execution) → DELEGATE

Rule of thumb: When in doubt, ALWAYS DELEGATE. Cascading failures are expensive.

</details> <details> <summary>Three Allowed Direct Operations</summary>

Only these can be executed directly by orchestrator:

•
Task() - Delegation itself
- •Use spawner subagent types when possible
- •Example: Task(subagent_type="htmlgraph:gemini-spawner", ...)
•
AskUserQuestion() - Clarifying requirements
- •Get user input before delegating
- •Example: AskUserQuestion("Should we use Redis or PostgreSQL?")
•
TodoWrite() - Tracking work items
- •Create/update todo lists
- •Example: TodoWrite(todos=[...])

SDK operations (create features, spikes, bugs):

•sdk.features.create()
•sdk.spikes.create()
•sdk.bugs.create()

Everything else MUST be delegated.

</details>

Model Selection & Spawner Guide

<details> <summary>Spawner Selection Decision Tree</summary>

Decision tree (check each in order):

•
Is this exploration/research/analysis?
- •Files search: YES → Gemini spawner (FREE)
- •Pattern analysis: YES → Gemini spawner (FREE)
- •Documentation reading: YES → Gemini spawner (FREE)
- •Learning unfamiliar system: YES → Gemini spawner (FREE)
•
Is this code implementation/testing?
- •Generate code: YES → Codex spawner (70% cheaper)
- •Fix bugs: YES → Codex spawner
- •Write tests: YES → Codex spawner
- •Refactor code: YES → Codex spawner
•
Is this git/GitHub operation?
- •Commit changes: YES → Copilot spawner (60% cheaper, GitHub-native)
- •Create PR: YES → Copilot spawner
- •Manage branches: YES → Copilot spawner
- •Review code: YES → Copilot spawner
•
Does this need deep reasoning?
- •Architecture decisions: YES → Claude Opus (expensive, but needed)
- •Complex design: YES → Claude Opus
- •Strategic planning: YES → Claude Opus
•
Is this multi-agent coordination?
- •Coordinate multiple spawners: YES → Claude Sonnet (mid-tier)
- •Complex workflows: YES → Claude Sonnet
•
All else fails → Task() with Haiku (fallback)

Spawner Subagent Types:

•gemini - FREE, 2M tokens/min, exploration & research
•codex - Cheap code specialist, implementation & testing
•copilot - Cheap git specialist, GitHub integration
•haiku - Generic Claude Haiku (use as fallback or when spawners fail)

</details> <details> <summary>Spawner Details & Configuration</summary>

Gemini Spawner (FREE - Exploration)

python

Task(
    subagent_type="gemini",
    description="Analyze authentication patterns",
    prompt="""
    Analyze codebase for:
    - All authentication patterns
    - OAuth implementations
    - Session management
    - JWT usage
    """
)

Best for:

•File searching (FREE!)
•Pattern analysis (FREE!)
•Documentation research (FREE!)
•Understanding unfamiliar systems (FREE!)

Codex Spawner (Cheap - Code)

python

Task(
    subagent_type="codex",
    description="Implement OAuth middleware",
    prompt="""
    Implement OAuth authentication:
    - Sandbox mode: workspace-write
    - Add JWT token generation
    - Include error handling
    - Write unit tests
    """
)

Best for:

•Code generation
•Bug fixes
•Test writing
•Refactoring
•Sandboxed execution

Copilot Spawner (Cheap - Git)

python

Task(
    subagent_type="copilot",
    description="Commit and create PR",
    prompt="""
    Commit changes and create PR:
    - Message: "feat: add OAuth authentication"
    - Files: src/auth/*.py, tests/test_auth.py
    - Create PR with description
    """
)

Best for:

•Git commits (60% cheaper than Task)
•PR creation
•Branch management
•GitHub integration
•Resolving conflicts

Task() with Sonnet/Opus (Strategic)

python

Task(
    prompt="Design authentication architecture...",
    subagent_type="sonnet"  # or "opus" for deep reasoning
)

Sonnet (Mid-tier):

•Coordinate complex workflows
•Multi-agent orchestration
•Fallback when spawners fail

Opus (Expensive):

•Deep reasoning
•Architecture decisions
•Strategic planning
•When quality matters more than cost

</details>

Delegation Patterns & Examples

<details> <summary>Basic Delegation Pattern</summary>

Simple exploration:

python

Task(
    subagent_type="gemini",
    description="Find all auth patterns",
    prompt="Search codebase for authentication patterns and summarize findings"
)

Code implementation:

python

Task(
    subagent_type="codex",
    description="Implement OAuth endpoint",
    prompt="Implement OAuth authentication endpoint with JWT support"
)

Git operations:

python

Task(
    subagent_type="copilot",
    description="Commit changes",
    prompt="Commit changes with message: 'feat: add OAuth authentication'"
)

</details> <details> <summary>Parallel Delegation (Multiple Independent Tasks)</summary>

Pattern: Spawn all at once, retrieve results independently

python

# Create all tasks in parallel (single message)
Task(
    subagent_type="gemini",
    description="Research auth patterns",
    prompt="Analyze existing authentication patterns..."
)

Task(
    subagent_type="codex",
    description="Implement OAuth",
    prompt="Implement OAuth flow..."
)

Task(
    subagent_type="copilot",
    description="Create PR",
    prompt="Commit and create pull request..."
)

# All run in parallel, optimized for cost:
# - Gemini: FREE
# - Codex: $ (cheap)
# - Copilot: $ (cheap)

Benefits:

•3 tasks in parallel: time = max(T1, T2, T3) instead of T1+T2+T3
•Cost optimization: Uses cheapest model for each task
•Independent results: Each task tracked separately

</details> <details> <summary>Sequential Delegation with Dependencies</summary>

Pattern: Chain dependent tasks in sequence

python

# 1. Research existing patterns
Task(
    subagent_type="gemini",
    description="Research OAuth patterns",
    prompt="Find all OAuth implementations in codebase..."
)

# 2. Wait for research, then implement
# (In next message after reading result)
research_findings = "..."  # Read from previous task result

Task(
    subagent_type="codex",
    description="Implement OAuth based on research",
    prompt=f"""
    Implement OAuth using discovered patterns:
    {research_findings}
    """
)

# 3. Wait for implementation, then commit
Task(
    subagent_type="copilot",
    description="Commit implementation",
    prompt="Commit OAuth implementation..."
)

When to use: When later tasks depend on earlier results

</details> <details> <summary>HtmlGraph Result Retrieval</summary>

Subagents report findings automatically:

When a Task() completes, findings are stored in HtmlGraph:

python

# SDK can retrieve results
from htmlgraph import SDK
sdk = SDK(agent='orchestrator')

# Get recent spike (subagent's findings)
spike = sdk.spikes.list(limit=1)[0]
findings = spike.get_findings()

Pattern: Read findings after Task completes

python

# 1. Delegate exploration
Task(
    subagent_type="gemini",
    description="Analyze auth patterns",
    prompt="Find all authentication patterns..."
)

# 2. Read findings from HtmlGraph
sdk = SDK(agent='orchestrator')
recent_spike = sdk.spikes.list(limit=1)[0]
findings = recent_spike.get_findings()

# 3. Use findings in next delegation
Task(
    subagent_type="codex",
    description="Implement based on findings",
    prompt=f"Implement authentication:\n{findings}"
)

</details> <details> <summary>Error Handling & Retries</summary>

Let subagents handle retries:

python

# WRONG - Don't retry directly as orchestrator
bash_result = Bash(command="git commit -m 'feat: new'")
if failed:
    # Retry directly (context pollution)
    Bash(command="git pull && git commit")  # More context used

# CORRECT - Subagent handles retries
Task(
    subagent_type="copilot",
    description="Commit changes with retry",
    prompt="""
    Commit changes:
    Message: "feat: new feature"

    If commit fails:
    1. Pull latest changes
    2. Resolve conflicts if any
    3. Retry commit
    4. Handle pre-commit hooks

    Report final status: success or failure
    """
)

Benefits:

•Subagent context handles retries (not your context)
•Cleaner error reporting
•Automatic recovery attempts
•You get clean success/failure

</details>

Advanced: Post-Compact Persistence

<details> <summary>Orchestrator Activation After Compact</summary>

How it works:

•Before compact, SDK sets environment variable: CLAUDE_ORCHESTRATOR_ACTIVE=true
•SessionStart hook detects post-compact state
•Orchestrator Directives Skill auto-activates
•This skill section appears automatically (first time post-compact)

Why: Preserve orchestration discipline after context compact

What you see:

•Skill automatically activates (no manual invocation needed)
•Quick start section visible by default
•Expand detailed sections as needed
•Full guidance available without re-reading docs

To manually trigger:

code

/orchestrator-directives

Environment variable:

bash

CLAUDE_ORCHESTRATOR_ACTIVE=true  # Set by SDK

</details> <details> <summary>Session Continuity Across Compacts</summary>

Features preserved across compact:

•Work items in HtmlGraph
•Feature/spike tracking
•Delegation patterns
•Model selection guidance
•This skill's guidance

What's lost:

•Your context (that's why compact happens)
•Intermediate tool outputs
•Local variables

Re-activation pattern:

code

Before compact:
- Work on features, track in HtmlGraph
- Delegate with clear prompts
- Use SDK to save progress

After compact:
- Orchestrator Skill auto-activates
- Re-read recent spikes for context
- Continue delegations
- Use Task IDs for parallel coordination

</details>

Core Philosophy

<details> <summary>Core Principles Summary</summary>

Principle 1: Delegation > Direct Execution

•Cascading failures consume exponentially more context than structured delegation
•One failed bash call becomes 3-5 calls with retries
•Delegation isolates failures to subagent context

Principle 2: Cost-First > Capability-First

•Use FREE/cheap AIs (Gemini, Codex, Copilot) before expensive Claude Code
•Gemini: FREE (exploration)
•Codex: 70% cheaper (code)
•Copilot: 60% cheaper (git)
•Claude: Expensive (strategic only)

Principle 3: You Don't Know the Outcome

•What looks like "one tool call" often becomes many
•Unexpected failures, conflicts, retries consume context
•Delegation removes unpredictability from orchestrator context

Principle 4: Parallel > Sequential

•Multiple subagents can work simultaneously
•Much faster than sequential execution
•Orchestrator stays available for decisions

Principle 5: Track Everything

•Use HtmlGraph SDK to track delegations
•Features, spikes, bugs created for all work
•Clear record of who did what

</details>

Core Philosophy

Delegation > Direct Execution. Cascading failures consume exponentially more context than structured delegation.

Cost-First > Capability-First. Use FREE/cheap AIs before expensive Claude models.

Quick Reference Table

<details> <summary>Operation Type → Correct Delegation</summary>

Operation	MUST Use	Cost	Fallback
Search files	Gemini spawner	FREE	Haiku
Pattern analysis	Gemini spawner	FREE	Haiku
Documentation research	Gemini spawner	FREE	Haiku
Code generation	Codex spawner	$ (70% off)	Sonnet
Bug fixes	Codex spawner	$ (70% off)	Haiku
Write tests	Codex spawner	$ (70% off)	Haiku
Git commits	Copilot spawner	$ (60% off)	Haiku
Create PRs	Copilot spawner	$ (60% off)	Haiku
Architecture	Claude Opus	$$$$	Sonnet
Strategic decisions	Claude Opus	$$$$	Task()

Key: FREE = No cost | $ = Cheap | $$$$ = Expensive (but necessary)

</details>

Related Skills

•/multi-ai-orchestration - Comprehensive model selection guide with detailed decision matrix
•/code-quality - Quality gates and pre-commit workflows
•/strategic-planning - HtmlGraph analytics for smart prioritization

Reference Documentation

•Complete Rules: See orchestration.md
•Advanced Patterns: See reference.md
•HtmlGraph SDK: from htmlgraph import SDK

Quick Summary

Cost-First Orchestration:

•Gemini (FREE) → exploration, research, analysis
•Codex (70% off) → code implementation, fixes, tests
•Copilot (60% off) → git operations, PRs
•Claude Opus → deep reasoning, strategy only

Orchestrator Rule: Only execute: Task(), AskUserQuestion(), TodoWrite(), SDK operations

Everything else → Delegate to appropriate spawner

When in doubt → DELEGATE