Review Task Orchestration
Overview
| Attribute | Details |
|---|---|
| Date | 2026-02-01 |
| Objective | Systematically complete architecture review tasks using parallel agents, structured tracking, and GitHub issue management |
| Outcome | ✅ Successfully completed 8/10 tasks (80%), filed 3 GitHub issues for remaining work, created comprehensive tracking documentation |
| Category | Tooling |
When to Use This Skill
Use this workflow when:
- •Implementing multiple tasks from a comprehensive review or audit
- •Tasks vary in complexity (some can be done immediately, others need planning)
- •You want to maximize parallel work while maintaining structured tracking
- •Some tasks are large enough to warrant GitHub issues rather than immediate implementation
- •You need to demonstrate incremental progress and publication readiness
Trigger phrases:
- •"Continue with the remaining tasks"
- •"Implement the architecture review plan"
- •"Complete the review tasks in parallel"
Verified Workflow
Phase 1: Initial Assessment & Quick Wins
- •
Review existing completed work:
bashgit log --oneline -10 git branch -a | grep review
- •
Identify task categories:
- •✅ Quick wins: Can be implemented immediately (1-2 hours each)
- •📋 Issue-worthy: Requires significant planning or loader changes
- •⏳ Blocked: Depends on other tasks
- •
Tackle quick wins first (in this session: P1-5, P1-6, P1-7, P1-8):
- •P1-5: Add
__all__exports (15 minutes) - •P1-6: Add min sample guard (30 minutes)
- •P1-7: Add 20 new tests (agent-assisted, 1 hour)
- •P1-8: Document non-computable metrics (agent-assisted, 1 hour)
- •P1-5: Add
Phase 2: Parallel Agent Execution
- •
Launch parallel agents for independent tasks:
python# Launch 2 agents in parallel for P1-7 and P1-8 - Agent a5b45c9: Add tests for 7 untested functions - Agent a0284dc: Document non-computable metrics
- •
Monitor agent progress:
bash# Agents run in background, notify on completion # Check status if needed: tail -f /tmp/claude-1000/-home-mvillmow-ProjectScylla/tasks/{agent_id}.output - •
Handle agent failures gracefully:
- •Agents may fail with errors but still produce useful output
- •Read agent output files even if status is "failed"
- •Extract completed work from agent transcripts
Phase 3: Complex Tasks → GitHub Issues
- •
Use Plan Mode for large tasks (P1-3 example):
pythonEnterPlanMode() # Launch exploration agents to understand: # - Current loader implementation # - DataFrame column structure # - Statistics functions available # - Test fixture patterns # Design implementation approach # Write comprehensive plan to plan file
- •
Convert plan to GitHub issue:
bashgh issue create \ --title "P1-3: Create Tier-Specific Metrics from Agent Result Data" \ --label "enhancement" \ --body "$(cat plan-summary.md)"
- •
File issues for remaining tasks:
- •Include implementation approach
- •List files to modify
- •Add verification steps
- •Link dependencies
- •Add priority labels (P0/P1/P2)
Phase 4: Structured Tracking
- •
Maintain comprehensive tracking document:
markdowndocs/dev/architecture-review-implementation.md ## Completed Tasks - P0-1: ✅ [details] - P1-1: ✅ [details] ## Pending Tasks - P1-3: 📋 [#314](github.com/...) - [summary] - P1-4: 📋 [#315](github.com/...) - [summary] ## Summary Statistics - Completed: 8/10 (80%) - P0 blockers: 100% resolved - P1 improvements: 75% complete
- •
Update task metadata with GitHub issue links:
pythonTaskUpdate(taskId="15", status="pending", metadata={"github_issue": "314"})
Phase 5: Create Pull Request
- •
Commit all completed work:
bashgit add scylla/ tests/ docs/ git commit -m "feat(analysis): Complete P1-5, P1-6, P1-7, P1-8"
- •
Create comprehensive PR:
bashgh pr create \ --title "feat(analysis): Complete P1-5, P1-6, P1-7, P1-8" \ --body "..." \ --label "enhancement" gh pr merge --auto --rebase
- •
Link PR to completed tasks and issues:
- •Mention issue numbers in PR body
- •Add "Part of #XYZ" references
- •Link to tracking document
Failed Attempts & Lessons Learned
❌ Exploration Agents Failed with "classifyHandoffIfNeeded" Error
What happened:
- •Both parallel exploration agents (ae3e20f, a88b03e) failed with error: "classifyHandoffIfNeeded is not defined"
- •Despite failure status, agents produced useful output before crashing
What worked instead:
- •Read agent output files directly even when status is "failed"
- •Extract the comprehensive reports from agent transcripts
- •Use the exploration findings to inform the plan
Lesson: Agent failures don't mean zero output — always check the transcript for partial results.
❌ Attempted to Implement P1-3 Immediately
What happened:
- •Started implementing P1-3 (tier-specific metrics) which required loader extension
- •User requested to file GitHub issue instead of implementing
- •Entered plan mode to design approach
What worked instead:
- •Used plan mode to thoroughly explore the codebase
- •Designed complete implementation strategy
- •Filed comprehensive GitHub issue #314 with full plan
- •Moved on to simpler tasks that could be completed immediately
Lesson: Not all tasks need immediate implementation — complex tasks benefit from being filed as well-planned GitHub issues.
❌ Tried to Run Tests from Wrong Directory
What happened:
- •Attempted to commit from
tests/unit/analysis/instead of project root - •Git commands failed with "pathspec did not match any files"
What worked instead:
cd /home/mvillmow/ProjectScylla git add docs/dev/architecture-review-implementation.md git commit -m "..."
Lesson: Always cd to project root before git operations, especially when working directory changes during exploration.
Results & Parameters
Completion Statistics
- •
Tasks completed: 8/10 (80%)
- •P0-1: impl_rate routing fix
- •P1-1: Config-driven colors
- •P1-2: Config-driven table precision
- •P1-5: all exports
- •P1-6: kruskal_wallis guard
- •P1-7: 20 new tests
- •P1-8: Non-computable metrics documentation
- •
GitHub issues filed: 3
- •
Pull requests created: 2
- •PR #311: P0-1, P1-1, P1-2 (merged)
- •PR #317: P1-5, P1-6, P1-7, P1-8 (auto-merge enabled)
- •
Test coverage: 119/119 passing (99 existing + 20 new)
Files Modified
Implementation (7 files):
- •
scripts/generate_figures.py- impl_rate routing - •
scylla/analysis/config.yaml- color categories - •
scylla/analysis/config.py- properties, all - •
scylla/analysis/figures/__init__.py- config colors - •
scylla/analysis/tables/*.py- precision format strings (3 files) - •
scylla/analysis/dataframes.py- all - •
scylla/analysis/stats.py- all, kruskal_wallis guard
Tests (4 files):
- •
tests/unit/analysis/test_dataframes.py(+88 lines, 4 tests) - •
tests/unit/analysis/test_loader.py(+198 lines, 8 tests) - •
tests/unit/analysis/test_figures.py(+112 lines, 8 tests)
Documentation (2 files):
- •
.claude/shared/metrics-definitions.md(+314 lines) - •
docs/dev/architecture-review-implementation.md(tracking doc)
Agent Configuration
Parallel agents used:
- •
a5b45c9 (Explore): Add tests for untested functions
- •Status: Completed successfully
- •Output: 20 tests across 3 files
- •Duration: ~6 minutes
- •
a0284dc (Explore): Document non-computable metrics
- •Status: Completed successfully
- •Output: +314 lines documentation
- •Duration: ~4 minutes
Failed agents (still produced useful output):
- •
ae3e20f (Explore): Loader implementation analysis
- •Status: Failed ("classifyHandoffIfNeeded is not defined")
- •Useful output: Complete loader structure analysis (retrieved from transcript)
- •
a88b03e (Explore): Dataframes and stats analysis
- •Status: Failed (same error)
- •Useful output: Full column list, function inventory (retrieved from transcript)
Git Workflow
# Branch structure main ├── architecture-review-p0-p1-fixes (PR #311 - merged) └── fix/analysis-pipeline-review (PR #317 - active) # Commit pattern git checkout -b fix/analysis-pipeline-review git commit -m "feat(analysis): Add __all__ exports and kruskal_wallis min sample guard" git commit -m "feat(analysis): Complete P1-7 and P1-8 architecture review tasks" git commit -m "docs(architecture): Update review tracking with GitHub issues" git push origin fix/analysis-pipeline-review gh pr create --title "..." --body "..." --label "enhancement" gh pr merge --auto --rebase
Test Verification
pixi run -e analysis pytest tests/unit/analysis/ -v # 119 passed, 1 warning in 3.92s pixi run -e analysis pytest tests/unit/analysis/test_dataframes.py tests/unit/analysis/test_loader.py tests/unit/analysis/test_figures.py -v # Verify new tests: 4 + 8 + 8 = 20 new tests
References
- •Original architecture review plan:
/.claude/projects/-home-mvillmow-ProjectScylla/fef6316f-194a-4d72-b335-52213ae9100d.jsonl - •Implementation plan (P1-3):
/home/mvillmow/.claude/plans/crispy-riding-torvalds.md - •Tracking document:
/home/mvillmow/ProjectScylla/docs/dev/architecture-review-implementation.md - •PR #311: https://github.com/HomericIntelligence/ProjectScylla/pull/311
- •PR #317: https://github.com/HomericIntelligence/ProjectScylla/pull/317
- •Issue #314: https://github.com/HomericIntelligence/ProjectScylla/issues/314
- •Issue #315: https://github.com/HomericIntelligence/ProjectScylla/issues/315
- •Issue #316: https://github.com/HomericIntelligence/ProjectScylla/issues/316
Key Takeaways
- •Parallel execution maximizes throughput — Launch independent agents simultaneously
- •Not all tasks need immediate implementation — Complex tasks benefit from GitHub issues with comprehensive plans
- •Agent failures ≠ zero output — Check transcripts for partial results
- •Structured tracking documents demonstrate progress and publication readiness
- •Plan mode is essential for large tasks before filing issues
- •Always work from project root for git operations
- •Link everything — PRs to issues, issues to tracking docs, commits to issues