Parallel Exploration (Background Scout Fan-Out)

Overview

When you need to answer "where/how does X work?" across multiple domains (codebase, tests, docs, OSS), investigating sequentially wastes time. Each investigation is independent and can happen in parallel.

Core principle: Decompose into independent sub-questions, spawn one hive_hive_background_task per sub-question, collect results asynchronously.

Safe in Planning mode: This is read-only exploration. It is OK to use during exploratory research even when there is no feature, no plan, and no approved tasks.

This skill is for read-only research. For parallel implementation work, use hive_skill("dispatching-parallel-agents") with hive_exec_start.

When to Use

Default to this skill when: Use when:

•Investigation spans multiple domains (code + tests + docs)
•User asks 2+ questions across different domains (e.g., code + tests, code + docs/OSS, code + config/runtime)
•Questions are independent (answer to A doesn't affect B)
•User asks 3+ independent questions (often as a numbered list or separate bullets)
•No edits needed (read-only exploration)
•User asks for an explorationthat likely spans multiple files/packages
•The work is read-only and the questions can be investigated independently

Only skip this skill when:

•Investigation requires shared state or context between questions
•It's a single focused question that is genuinely answerable with one quick grep + one file read
•Questions are dependent (answer A materially changes what to ask for B)
•Work involves file edits (use Hive tasks / Forager instead)

Important: Do not treat "this is exploratory" as a reason to avoid delegation. This skill is specifically for exploratory research when fan-out makes it faster and cleaner.

The Pattern

1. Decompose Into Independent Questions

Split your investigation into 2-4 independent sub-questions. Good decomposition:

Domain	Question Example
Codebase	"Where is X implemented? What files define it?"
Tests	"How is X tested? What test patterns exist?"
Docs/OSS	"How do other projects implement X? What's the recommended pattern?"
Config	"How is X configured? What environment variables affect it?"

Bad decomposition (dependent questions):

•"What is X?" then "How is X used?" (second depends on first)
•"Find the bug" then "Fix the bug" (not read-only)

2. Spawn Background Tasks (Fan-Out)

Launch all tasks before waiting for any results:

typescript

// Fan-out: spawn all tasks first
hive_background_task({
  agent: "scout-researcher",
  description: "Find hive_background_task implementation",
  prompt: `Where is hive_background_task implemented and registered?
    - Find the tool definition
    - Find the plugin registration
    - Return file paths with line numbers`,
  sync: false
})

hive_background_task({
  agent: "scout-researcher",
  description: "Analyze background task concurrency",
  prompt: `How does background task concurrency/queueing work?
    - Find the manager/scheduler code
    - Document the concurrency model
    - Return file paths with evidence`,
  sync: false
})

hive_background_task({
  agent: "scout-researcher",
  description: "Find parent notification mechanism",
  prompt: `How does parent notification work for background tasks?
    - Where is the notification built?
    - How is it sent to the parent session?
    - Return file paths with evidence`,
  sync: false
})

Key points:

•Use agent: "scout-researcher" for read-only exploration
•Use sync: false to return immediately (non-blocking)
•Give each task a clear, focused description
•Make prompts specific about what evidence to return

3. Continue Working (Optional)

While tasks run, you can:

•Work on other aspects of the problem
•Prepare synthesis structure
•Start drafting based on what you already know

You'll receive a <system-reminder> notification when each task completes.

4. Collect Results

When notified of completion, retrieve results:

typescript

// Get output from completed task
hive_background_output({
  task_id: "task-abc123",
  block: false  // Don't wait, task already done
})

For incremental output (long-running tasks):

typescript

// First call - get initial output
hive_background_output({
  task_id: "task-abc123",
  block: true,      // Wait for output
  timeout: 30000    // 30 second timeout
})
// Returns: { output: "...", cursor: "5" }

// Later call - get new output since cursor
hive_background_output({
  task_id: "task-abc123",
  cursor: "5",      // Resume from message 5
  block: true
})

5. Synthesize Findings

Combine results from all tasks:

•Cross-reference findings (file X mentioned by tasks A and B)
•Identify gaps (task C found nothing, need different approach)
•Build coherent answer from parallel evidence

6. Cleanup (If Needed)

Cancel tasks that are no longer needed:

typescript

// Cancel specific task
hive_background_cancel({ task_id: "task-abc123" })

// Cancel all your background tasks
hive_background_cancel({ all: true })

Prompt Templates

Codebase Slice

code

Investigate [TOPIC] in the codebase:
- Where is [X] defined/implemented?
- What files contain [X]?
- How does [X] interact with [Y]?

Return:
- File paths with line numbers
- Brief code snippets as evidence
- Key patterns observed

Tests Slice

code

Investigate how [TOPIC] is tested:
- What test files cover [X]?
- What testing patterns are used?
- What edge cases are tested?

Return:
- Test file paths
- Example test patterns
- Coverage gaps if obvious

Docs/OSS Slice

code

Research [TOPIC] in external sources:
- How do other projects implement [X]?
- What does the official documentation say?
- What are common patterns/anti-patterns?

Return:
- Links to relevant docs/repos
- Key recommendations
- Patterns that apply to our codebase

Real Example

Investigation: "How does the background task system work?"

Decomposition:

•Implementation: Where is hive_background_task tool defined?
•Concurrency: How does task scheduling/queueing work?
•Notifications: How does parent session get notified?

Fan-out:

typescript

// Task 1: Implementation
hive_background_task({
  agent: "scout-researcher",
  description: "Find hive_background_task implementation",
  prompt: "Where is hive_background_task implemented? Find tool definition and registration.",
  sync: false
})

// Task 2: Concurrency
hive_background_task({
  agent: "scout-researcher",
  description: "Analyze concurrency model",
  prompt: "How does background task concurrency work? Find the manager/scheduler.",
  sync: false
})

// Task 3: Notifications
hive_background_task({
  agent: "scout-researcher",
  description: "Find notification mechanism",
  prompt: "How are parent sessions notified of task completion?",
  sync: false
})

Results:

•Task 1: Found background-tools.ts (tool definition), index.ts (registration)
•Task 2: Found manager.ts with concurrency=3 default, queue-based scheduling
•Task 3: Found session.prompt() call in manager for parent notification

Synthesis: Complete picture of background task lifecycle in ~1/3 the time of sequential investigation.

Common Mistakes

Spawning sequentially (defeats the purpose):

typescript

// BAD: Wait for each before spawning next
const result1 = await hive_background_task({ ..., sync: true })
const result2 = await hive_background_task({ ..., sync: true })

typescript

// GOOD: Spawn all, then collect
hive_background_task({ ..., sync: false })  // Returns immediately
hive_background_task({ ..., sync: false })  // Returns immediately
hive_background_task({ ..., sync: false })  // Returns immediately
// ... later, collect results with hive_background_output

Too many tasks (diminishing returns):

•2-4 tasks: Good parallelization
•5+ tasks: Overhead exceeds benefit, harder to synthesize

Dependent questions:

•Don't spawn task B if it needs task A's answer
•Either make them independent or run sequentially

Using for edits:

•Scout is read-only; use Forager for implementation
•This skill is for exploration, not execution

Key Benefits

•Speed - 3 investigations in time of 1
•Focus - Each Scout has narrow scope
•Independence - No interference between tasks
•Flexibility - Cancel unneeded tasks, add new ones

Verification

After using this pattern, verify:

• All tasks spawned before collecting any results (true fan-out)
• Received notifications for completed tasks
• Successfully retrieved output with hive_background_output
• Synthesized findings into coherent answer