Reviews and validates work across sessions for consistency and correctness.
Review Protocol (The Multiplexer)
Execute §CMD_EXECUTE_SKILL_PHASES.
Session Parameters
{
"taskType": "RESOLVE",
"phases": [
{"label": "0", "name": "Setup",
"steps": ["§CMD_PARSE_PARAMETERS", "§CMD_SELECT_MODE", "§CMD_INGEST_CONTEXT_BEFORE_WORK"],
"commands": ["§CMD_FIND_TAGGED_FILES"],
"proof": ["mode", "session_dir", "parameters_parsed", "debriefs_discovered"]},
{"label": "1", "name": "Discovery",
"steps": [],
"commands": ["§CMD_APPEND_LOG"],
"proof": ["debriefs_read", "sibling_logs_plans_read", "debrief_cards_logged", "cross_session_checks", "conflicts_found"]},
{"label": "2", "name": "Dashboard & Interrogation",
"steps": ["§CMD_INTERROGATE"],
"commands": ["§CMD_ASK_ROUND", "§CMD_LOG_INTERACTION"],
"proof": ["dashboard_presented", "depth_chosen", "rounds_completed", "debriefs_reviewed", "verdicts_tagged", "log_entries"]},
{"label": "3", "name": "Synthesis",
"steps": ["§CMD_RUN_SYNTHESIS_PIPELINE"], "commands": [], "proof": []},
{"label": "3.1", "name": "Checklists",
"steps": ["§CMD_VALIDATE_ARTIFACTS", "§CMD_RESOLVE_BARE_TAGS", "§CMD_PROCESS_CHECKLISTS"], "commands": [], "proof": []},
{"label": "3.2", "name": "Debrief",
"steps": ["§CMD_GENERATE_DEBRIEF"], "commands": [], "proof": ["debrief_file", "debrief_tags"]},
{"label": "3.3", "name": "Pipeline",
"steps": ["§CMD_MANAGE_DIRECTIVES", "§CMD_PROCESS_DELEGATIONS", "§CMD_DISPATCH_APPROVAL", "§CMD_CAPTURE_SIDE_DISCOVERIES", "§CMD_MANAGE_ALERTS", "§CMD_REPORT_LEFTOVER_WORK"], "commands": [], "proof": []},
{"label": "3.4", "name": "Close",
"steps": ["§CMD_REPORT_ARTIFACTS", "§CMD_REPORT_SUMMARY", "§CMD_CLOSE_SESSION", "§CMD_PRESENT_NEXT_STEPS"], "commands": [], "proof": []}
],
"nextSkills": ["/implement", "/document", "/brainstorm", "/analyze", "/chores"],
"directives": [],
"logTemplate": "assets/TEMPLATE_REVIEW_LOG.md",
"debriefTemplate": "assets/TEMPLATE_REVIEW.md",
"requestTemplate": "assets/TEMPLATE_REVIEW_REQUEST.md",
"responseTemplate": "assets/TEMPLATE_REVIEW_RESPONSE.md",
"modes": {
"quality": {"label": "Quality", "description": "Thorough validation, evidence-driven", "file": "modes/quality.md"},
"progress": {"label": "Progress", "description": "Cross-session status reporting", "file": "modes/progress.md"},
"evangelize": {"label": "Evangelize", "description": "Stakeholder communication, narrative", "file": "modes/evangelize.md"},
"custom": {"label": "Custom", "description": "User-defined", "file": "modes/custom.md"}
}
}
0. Setup
§CMD_REPORT_INTENT:
Reviewing ___ session debriefs. Mode: ___. Scope: ___ tagged files discovered. Focus: session activation, mode selection, debrief discovery.
§CMD_EXECUTE_PHASE_STEPS(0.0.*)
- •Scope: Identify unvalidated debriefs (
#needs-reviewand#needs-rework) and review delegation requests.
Debrief & Request Discovery (§CMD_FIND_TAGGED_FILES):
- •Search
sessions/directory for#needs-reviewand#needs-reworktags. - •Glob for
sessions/**/REVIEW_REQUEST_*.mdto find review delegation requests. - •Merge and deduplicate by path. Output all found files with their source (tag-discovered or REQUEST file).
Mode Selection (§CMD_SELECT_MODE):
On selection: Read the corresponding modes/{mode}.md file. It defines Role, Goal, Mindset, and Review Strategy.
On "Custom": Read ALL 3 named mode files first (modes/quality.md, modes/progress.md, modes/evangelize.md), then accept user's framing. Parse into role/goal/mindset.
Record: Store the selected mode. It configures:
- •Phase 0 role (from mode file)
- •Phase 2 review criteria (from mode file)
Model Selection (after mode selection):
Execute §CMD_SUGGEST_EXTERNAL_MODEL with:
modelQuestion: "Use an external model for writing the review report instead of Claude?"
Records externalModel (model name or "claude").
Phase Transition
Phase 0 always proceeds to Phase 1 -- no transition question needed.
1. Discovery & Cross-Session Analysis
Read everything. Build the global picture.
§CMD_REPORT_INTENT:
Reading ___ debriefs and their sibling logs. Building debrief cards and cross-session conflict analysis.
§CMD_EXECUTE_PHASE_STEPS(1.0.*)
Action:
- •Read All Debriefs: For each discovered file, read the full debrief.
- •Read All Logs: For each debrief's session directory, also read:
- •The
_LOG.mdfile(s) (e.g.,IMPLEMENTATION_LOG.md,BRAINSTORM_LOG.md) - •Any
_PLAN.mdfiles for additional context. - •Goal: Catch buried concerns, rejected alternatives, and internal reasoning not surfaced in the debrief.
- •The
- •Log Debrief Cards: For each debrief, execute
§CMD_APPEND_LOGwith theDebrief Cardschema. - •Cross-Session Analysis: Analyze ALL debriefs together against the 4 cross-session checks:
- •File Overlap: Did multiple sessions touch the same files?
- •Schema/Interface Conflicts: Did sessions make incompatible changes to shared types?
- •Contradictory Decisions: Did session A decide X while session B decided not-X?
- •Dependency Order: Did any session depend on another session's unvalidated output?
- •Log Conflicts: For each finding, execute
§CMD_APPEND_LOGwith theCross-Session Conflictschema.
Do NOT present findings to the user yet. Complete the full analysis first.
Phase Transition
Execute §CMD_GATE_PHASE:
custom: "Skip to Phase 3: Synthesis | I already know the verdicts, just write the report"
2. Dashboard & Per-Debrief Interrogation
Phase 2a: The Dashboard
Present the global picture first.
§CMD_REPORT_INTENT:
Presenting dashboard for ___ debriefs and ___ cross-session findings. Awaiting direction before per-debrief drill-down.
§CMD_EXECUTE_PHASE_STEPS(2.0.*)
Action:
- •Present Cross-Session Findings: Output the cross-session analysis (file overlaps, conflicts, contradictions, dependencies). If none found, say "No cross-session conflicts detected."
- •Present All Summary Cards: For each debrief, output a summary card:
- •Session:
[Session Dir] - •File:
[Debrief Filename](#needs-review|#needs-rework) - •Goal: 1-line session goal.
- •What Was Done: 2-3 bullet points.
- •Files Touched: Key files.
- •Risk Flags: Any concerns from log inspection. "None" if clean.
- •Agent Confidence: High/Medium/Low (inferred from debrief tone and log).
- •Session:
- •If
#needs-rework: Include the previous## Rework Notescontent and ask: "This was previously rejected. Has the underlying work been redone?"
Execute AskUserQuestion (multiSelect: false):
"Dashboard presented. How to proceed?"
- •"Proceed to per-debrief review" -- Walk through each debrief individually
- •"Discuss dashboard first" -- I want to talk about the cross-session findings
Phase 2b: Per-Debrief Interrogation
Walk through each debrief and REQUEST file with the user.
Interrogation Depth Selection
Before starting per-debrief review, present this choice via AskUserQuestion (multiSelect: false):
"How thorough should the per-debrief review be?"
| Depth | Minimum Rounds Per Debrief | When to Use |
|---|---|---|
| Short | 1 round | Quick validation, trusted agent, small scope |
| Medium | 2 rounds | Standard review, moderate complexity |
| Long | 3+ rounds | Deep audit, critical changes, untrusted work |
| Absolute | Until ALL concerns resolved | High-risk sessions, production changes, security-sensitive |
Record the user's choice. This sets the minimum per debrief -- the agent can always ask more, and the user can always say "approve" after the minimum is met.
Topics (Review)
Examples of themes to explore per debrief. Adapt to the session -- skip irrelevant ones, invent new ones as needed.
Standard topics (typically covered once per debrief):
- •Acceptance criteria -- Did the session achieve what was asked?
- •Test coverage -- Were tests written/updated? Are they passing? Any skipped?
- •Breaking changes -- Did this session change interfaces, schemas, or APIs that other code depends on?
- •Documentation alignment -- Were relevant docs updated, or is there now a doc-code drift?
- •Risk flags -- Did the agent express concerns in the log that didn't make it to the debrief?
- •Buried alternatives -- Did the agent reject an option that the user should know about?
- •TODO completeness -- Anything explicitly deferred or marked as future work?
- •Cross-session consistency -- Does this session's output conflict with other concurrent sessions?
Repeatable topics (can be selected any number of times):
- •Followup -- Clarify or revisit answers from previous rounds
- •Devil's advocate -- Challenge assumptions and decisions made so far
- •What-if scenarios -- Explore hypotheticals, edge cases, and alternative futures
- •Deep dive -- Drill into a specific topic from a previous round in much more detail
Standard Validation Checklist (LLM Internal Guidance)
This is YOUR internal rubric -- do NOT present it as a form. Use it to generate contextualized findings.
- •Goal Alignment -- Did the session achieve what was asked?
- •Completeness -- Are all template sections filled? Any placeholders or "TBD" left?
- •Test Status -- Were tests written/updated? Are they passing? Any skipped?
- •Breaking Changes -- Did this session change interfaces, schemas, or APIs that other code depends on?
- •Risk Flags -- Did the agent express concerns/worries in the log that didn't make it to the debrief?
- •Buried Alternatives -- Did the agent reject an option in the log that the user should know about?
- •TODOs & Leftovers -- Anything explicitly deferred or marked as future work?
- •Doc Alignment -- Were relevant docs updated, or is there now a doc-code drift?
Per-Debrief Review Protocol
For each debrief (sequentially):
Round counter: Output it on each debrief: "Reviewing debrief N of M -- Round R / {depth_minimum}+"
- •
Analyze Against Checklist (Internal): Review the debrief against the Standard Validation Checklist. Filter and contextualize -- skip checks that aren't relevant. For each relevant check, prepare a specific, contextualized finding.
- •Bad: "Test Status: PASS" (meaningless)
- •Good: "Test Status: 164/164 estimate + 57/57 viewer tests pass. No skipped tests. The new
mutate()context has 3 dedicated tests covering capture, backwards compat, and before snapshot."
- •
Present to User: Use
AskUserQuestionwith structured options:- •Present your findings as detailed descriptions per option.
- •Ask targeted questions about anything ambiguous or concerning.
- •Always include these options:
- •"Approve (clean)" -- No issues found, validate the debrief
- •"Approve + note TODOs" -- Validate, but capture follow-up work
- •"Flag for rework" -- Reject, mark for rework with reason
- •"I have questions" -- Discuss before deciding
- •
Process User Response:
- •Approve (clean): Validate the debrief -- swap tags and log:
Record the verdict inbash
engine tag swap "$FILE" '#needs-review,#needs-rework' '#done-review'
REVIEW_LOG.md. LogVerdict: Validated. No leftovers. - •Approve + note TODOs: Validate (same tag swap as clean approve). Then immediately ask: "What follow-up work should I note down?" Capture the user's response and log it as
Leftover Spawned. These become micro-dehydrated prompts in the Leftovers section of the final REVIEW.md report. - •Flag for Rework: Mark the debrief as needing rework:
Then append abash
engine tag swap "$FILE" '#needs-review' '#needs-rework'
## Rework Notessection at the end with: the date of rejection, the user's stated reason, and specific items to address. If## Rework Notesalready exists, append a new dated entry under it. Record the verdict inREVIEW_LOG.md. Ask user for the rework reason. LogVerdict: Needs Rework. The leftover prompt generated for this rework MUST include the instruction:- •Execute
§CMD_SWAP_TAG_IN_FILEto replace#needs-reworkwith#done-review. - •Append a short resolution entry to the debrief's
## Rework Notessection:code### [YYYY-MM-DD] Resolved [1-2 lines describing what was done] See: [REVIEW.md](../YYYY_MM_DD_REVIEW_N/REVIEW.md)
- •Execute
- •Questions/Discussion: Answer the user's question fully, then re-present the approval question.
- •Approve (clean): Validate the debrief -- swap tags and log:
- •
Log: Execute
§CMD_LOG_INTERACTIONafter each user interaction. Execute§CMD_APPEND_LOGwith the verdict schema. - •
Repeat for next debrief.
Per-REQUEST File Review
For each discovered REVIEW_REQUEST_*.md file (after all debriefs are processed):
- •
Read REQUEST: Read the full REQUEST file. Extract structured fields: Topic, Context, Expectations, Requesting Session.
- •
Read Linked Context: If the REQUEST references a session or specific files, read them.
- •
Present to User: Show the REQUEST summary and ask:
- •"Fulfill request" -- Review the linked work and write a RESPONSE file
- •"Defer" -- Leave the REQUEST for a future review session
- •"Dismiss" -- Remove the REQUEST (tag swap to
#done-review)
- •
On "Fulfill request": Review the linked artifacts (debrief, code, etc.) using the Standard Validation Checklist. Then apply the same verdict options (Approve clean / Approve + TODOs / Flag for rework / Questions).
- •
Write RESPONSE: After the verdict, write a
REVIEW_RESPONSE_[TOPIC].mdin the review session directory using§CMD_WRITE_FROM_TEMPLATEwith theTEMPLATE_REVIEW_RESPONSE.mdtemplate. Also add a## Responsebreadcrumb section to the original REQUEST file:markdown## Response **Reviewed by**: `[review session dir]` **Verdict**: [Validated / Needs Rework] **Response file**: `[path to REVIEW_RESPONSE_*.md]`
- •
Tag the REQUEST: Swap the REQUEST file's tag:
- •Validated:
engine tag swap [REQUEST_FILE] '#needs-review' '#done-review' - •Needs rework:
engine tag swap [REQUEST_FILE] '#needs-review' '#needs-rework'
- •Validated:
Interrogation Exit Gate
After reaching minimum rounds for a debrief, present this choice via AskUserQuestion (multiSelect: true):
"Debrief review round complete (minimum met). What next?"
- •"Approve and move to next debrief" -- (terminal: if selected, skip all others and proceed)
- •"More review (1 more round)" -- Standard review round, then this gate re-appears
- •"Devil's advocate round" -- 1 round challenging the session's decisions, then this gate re-appears
- •"What-if scenarios round" -- 1 round exploring hypotheticals, then this gate re-appears
- •"Deep dive round" -- 1 round drilling into a prior topic, then this gate re-appears
Execution order (when multiple selected): Standard round first -> Devil's advocate -> What-ifs -> Deep dive -> re-present exit gate.
For Absolute depth: Do NOT offer the exit gate until you have zero remaining concerns. Ask: "Round N complete. I still have concerns about [X]. Continuing..."
3. Synthesis
When all debriefs are reviewed.
§CMD_REPORT_INTENT:
Synthesizing. ___ debriefs validated, ___ flagged for rework, ___ leftovers spawned. Producing REVIEW.md with verdicts and dehydrated leftover prompts.
§CMD_EXECUTE_PHASE_STEPS(3.0.*)
Debrief generation:
If externalModel is not "claude" (external model path):
- •Gather context file paths: Collect paths to all reviewed debriefs, their logs, REVIEW_LOG.md, DETAILS.md, and TEMPLATE_REVIEW.md. Do NOT read them into context.
- •Compose prompt: Describe the review template structure, all verdicts, cross-session findings, and leftover work items.
- •Execute
§CMD_EXECUTE_EXTERNAL_MODELwith:- •
prompt: The composed review synthesis instructions - •
template:TEMPLATE_REVIEW.mdpath - •
system:"You are a senior technical reviewer producing a structured review report. Output ONLY the document content in Markdown. Follow the template structure exactly." - •
contextFiles: All debrief, log, and details file paths
- •
- •Write the output as REVIEW.md and proceed with tagging.
If externalModel is "claude" (default path):
- •Cross-Session Analysis: Transcribe findings from Phase 1.
- •Per-Debrief Verdicts: Condensed card for each debrief with verdict.
- •Leftovers: For each rework item or discovered TODO, generate a micro-dehydrated prompt:
- •Simple tasks (delete a file, rename, small config change): Just a plain instruction. No command protocol needed.
- •Complex tasks (feature rework, bug investigation, test gaps): Recommend a command (
/implement,/fix,/test,/analyze) with a self-contained prompt referencing the review report and original session. - •Enough context for the user to copy-paste and immediately act.
Walk-through config:
§CMD_WALK_THROUGH_RESULTS Configuration: mode: "results" gateQuestion: "Review complete. Walk through the verdicts?" debriefFile: "REVIEW.md"