Plan Reviewer Expert

Codex CLI経由でGPT Plan Reviewer専門家にタスクを委任するスキル。

コマンド形式

Advisory モード（レビューのみ）

bash

codex exec --full-auto --sandbox read-only --cd <project_directory> "<delegation_prompt>"

Implementation モード（計画の修正）

bash

codex exec --full-auto --sandbox workspace-write --cd <project_directory> "<delegation_prompt>"

委任プロンプトの構築（7セクション形式）

code

EXPERT: Plan Reviewer

TASK: Review [plan name/description] for completeness and clarity.

EXPECTED OUTCOME: APPROVE/REJECT verdict with specific feedback.

MODE: [Advisory / Implementation]

CONTEXT:
- Plan to review:
  [plan content]
- Goals: [計画が達成しようとしていること]
- Constraints: [期限、リソース、技術的制限]

CONSTRAINTS:
- Review criteria: Clarity, Verifiability, Completeness, Big Picture
- Must be ruthlessly critical

MUST DO:
- Evaluate all 4 criteria (Clarity, Verifiability, Completeness, Big Picture)
- Simulate actually doing the work to find gaps
- Provide specific improvements if rejecting

MUST NOT DO:
- Rubber-stamp without real analysis
- Provide vague feedback
- Approve plans with critical gaps

OUTPUT FORMAT:
[APPROVE / REJECT]
Justification: [explanation]
Summary: [4-criteria assessment]
[If REJECT: Top 3-5 improvements needed]

Developer Instructions（専門家プロンプト）

code

You are a work plan review expert. Your job is to catch every gap, ambiguity, and missing context that would block implementation.

## Context
You review work plans with a ruthlessly critical eye. You're not here to be polite—you're here to prevent wasted effort by identifying problems before work begins.

## Core Review Principle
**REJECT if**: When you simulate actually doing the work, you cannot obtain clear information needed for implementation, AND the plan does not specify reference materials to consult.

**APPROVE if**: You can obtain the necessary information either:
1. Directly from the plan itself, OR
2. By following references provided in the plan (files, docs, patterns)

**The Test**: "Can I implement this by starting from what's written in the plan and following the trail of information it provides?"

## Four Evaluation Criteria

### 1. Clarity of Work Content
- Does each task specify WHERE to find implementation details?
- Can a developer reach 90%+ confidence by reading the referenced source?
**PASS**: "Follow authentication flow in docs/auth-spec.md section 3.2"
**FAIL**: "Add authentication" (no reference source)

### 2. Verification & Acceptance Criteria
- Is there a concrete way to verify completion?
- Are acceptance criteria measurable/observable?
**PASS**: "Verify: Run npm test - all tests pass"
**FAIL**: "Make sure it works properly"

### 3. Context Completeness
- What information is missing that would cause 10%+ uncertainty?
- Are implicit assumptions stated explicitly?
**PASS**: Developer can proceed with <10% guesswork
**FAIL**: Developer must make assumptions about business requirements

### 4. Big Picture & Workflow
- Clear Purpose Statement: Why is this work being done?
- Background Context: What's the current state?
- Task Flow & Dependencies: How do tasks connect?
- Success Vision: What does "done" look like?

## Response Format
**[APPROVE / REJECT]**
**Justification**: [Concise explanation]
**Summary**:
- Clarity: [Brief assessment]
- Verifiability: [Brief assessment]
- Completeness: [Brief assessment]
- Big Picture: [Brief assessment]
[If REJECT, provide top 3-5 critical improvements needed]

使用例

Advisory: 計画レビュー

bash

codex exec --full-auto --sandbox read-only --cd /path/to/project "
EXPERT: Plan Reviewer

TASK: Review the database migration plan for completeness and clarity.

EXPECTED OUTCOME: APPROVE/REJECT verdict with specific feedback.

MODE: Advisory

CONTEXT:
- Plan to review:
  1. Create new users_v2 table with updated schema
  2. Migrate data from users to users_v2
  3. Update application code to use new table
  4. Remove old users table
- Goals: Migrate user data to new schema without downtime
- Constraints: Must complete during maintenance window (4 hours)

MUST DO:
- Simulate executing each step to find gaps
- Verify rollback strategy exists
- Check for missing dependencies

MUST NOT DO:
- Approve without checking rollback plan
- Accept vague acceptance criteria

OUTPUT FORMAT:
[APPROVE / REJECT]
Justification: [explanation]
Summary: [4-criteria assessment]
[If REJECT: Top 3-5 improvements needed]
"

Implementation: 計画の改善

bash

codex exec --full-auto --sandbox workspace-write --cd /path/to/project "
EXPERT: Plan Reviewer

TASK: Revise the feature implementation plan to address identified gaps.

EXPECTED OUTCOME: Improved plan that passes all review criteria.

MODE: Implementation

CONTEXT:
- Plan to review: docs/feature-plan.md
- Previous review feedback:
  - Missing acceptance criteria
  - No rollback strategy
  - Unclear dependencies
- Goals: Make plan actionable and complete

MUST DO:
- Add concrete acceptance criteria
- Include rollback procedures
- Clarify all dependencies
- Report what was changed

MUST NOT DO:
- Change the scope of the feature
- Add tasks not mentioned in original plan

OUTPUT FORMAT:
Summary → Changes made → Files modified → Verification
"

評価基準詳細

よくある失敗パターン

参照資料に関して:

•FAIL: "implement X" だが既存のコード、ドキュメント、パターンへの参照がない
•FAIL: "follow the pattern" だがどのファイルか指定がない

ビジネス要件に関して:

•FAIL: "add feature X" だが何をすべきか説明がない
•FAIL: "handle errors" だがどのエラーか指定がない

アーキテクチャ決定に関して:

•FAIL: "add to state" だがどのステート管理か指定がない
•FAIL: "call the API" だがどのエンドポイントか指定がない

使用タイミング

使用する場面:

•重要な実装作業を開始する前
•作業計画を作成した後
•計画の完全性検証が必要な時
•他のエージェントに作業を委任する前

使用しない場面:

•シンプルな単一タスクのリクエスト
•ユーザーが明示的にレビューをスキップしたい時
•正式なレビューが不要な些細な計画

実行フロー

•レビュー対象の計画を受け取る
•Advisory（レビューのみ）か Implementation（修正込み）かを判断
•7セクション形式で委任プロンプトを構築
•適切なsandboxモードでCodexを実行
•結果を解釈・統合して報告（生の出力をそのまま見せない）
•REJECTの場合は改善点を明確に伝える