/autolab - Autolab Workflow Operator
Use this skill when the user wants to operate or troubleshoot an Autolab workflow.
Quick Triage (Top 3 Commands)
- •
autolab status - •
autolab verify --stage <stage> - •
autolab run
For package/scaffold upgrades, run: autolab update.
Decision Tree
- •If verification fails: fix artifacts/verifier contracts first, then rerun
autolab verify --stage <stage>. - •If stage does not advance but verification passes: inspect
review_result.json/ decision gates for that stage. - •If repeated retries or escalations: inspect guardrails and retry policy in
.autolab/verifier_policy.yaml. - •If SLURM is involved: validate
run_manifest.json+docs/slurm_job_list.mdbefore retrying orchestration.
Table Of Contents
- •Goal
- •Stage flow and contracts
- •Command resolution
- •Read-first context checklist
- •Mode and cadence selection
- •Common tasks
- •Failure playbooks
- •Safe defaults
Goal
Help the user execute Autolab safely and efficiently by:
- •choosing the right runtime mode (
standardvsassistant) - •selecting run cadence (
runvsloop --auto) - •applying policy knobs in
.autolab/verifier_policy.yaml - •diagnosing no-transition / retry / escalation outcomes
See also: docs/workflow_modes.md for mode boundaries.
Stage Flow And Contracts
hypothesis -> design -> implementation -> implementation_review -> launch -> slurm_monitor -> extract_results -> update_docs -> decide_repeat
Verifier categories below are registry capabilities; policy controls actual requirements. Use autolab explain <stage> for effective requirements.
- •
hypothesis:hypothesis.md; categoriesschema,prompt_lint - •
design:design.yaml; categoriesschema,prompt_lint - •
implementation:implementation_plan.md; categoriesdry_run,schema,prompt_lint - •
implementation_review:implementation_review.md,review_result.json; categoriesdry_run,schema,prompt_lint,consistency,env_smoke,docs_target_update - •
launch: launch script +runs/<run_id>/run_manifest.json; categoriesschema,prompt_lint,consistency,env_smoke - •
slurm_monitor:runs/<run_id>/run_manifest.json(+ ledger for SLURM); categoriesenv_smoke - •
extract_results:runs/<run_id>/metrics.json,analysis/summary.md; categoriesschema,prompt_lint,consistency,env_smoke - •
update_docs:docs_update.md; categoriesschema,prompt_lint,consistency,docs_target_update - •
decide_repeat:decision_result.json; categoriesschema,prompt_lint,consistency
Command Resolution
Use this order:
- •
autolab ... - •
python -m autolab ... - •
PYTHONPATH=src python -m autolab ...
Read-First Context Checklist
Inspect before recommending changes:
- •
.autolab/state.json - •
.autolab/backlog.yaml - •
.autolab/verifier_policy.yaml - •
docs/todo.md - •stage artifacts under
experiments/<type>/<iteration_id>/
Prefer autolab status first.
Mode And Cadence Selection
Standard mode
Use for deterministic stage-machine control and verifier/debug loops.
- •one-step:
autolab run - •with pre-verify:
autolab run --verify
Assistant mode
Use for task-driven cycles (select -> implement -> verify -> review).
- •one-step:
autolab run --assistant - •unattended:
autolab loop --assistant --auto --max-hours <h> --max-iterations <n>
Decision handling
At decide_repeat:
- •explicit:
autolab run --decision hypothesis|design|stop|human_review - •auto:
autolab run --auto-decision
Common Tasks
Upgrade autolab
- •Run
autolab update. - •Confirm output shows
latest_tagand either:- •
action: already up to date, or - •
action: upgradingplus scaffold sync status.
- •
Stuck stage
- •
autolab status - •
autolab verify --stage <stage> - •Fix the failing artifact(s) or policy mismatch.
- •Re-run
autolab run.
SLURM issues
- •Inspect manifest:
cat experiments/<type>/<iteration_id>/runs/<run_id>/run_manifest.json - •Verify ledger:
autolab slurm-job-list verify --manifest <manifest> --doc docs/slurm_job_list.md - •Repair ledger if needed:
autolab slurm-job-list append --manifest <manifest> --doc docs/slurm_job_list.md - •Re-run
autolab verify --stage launchorautolab verify --stage slurm_monitor.
Assistant loop issues
- •Check
assistant_mode,task_cycle_stage, and guardrail counters viaautolab status. - •Confirm meaningful-change policy (
autorun.meaningful_change.*). - •Ensure backlog/todo tasks are actionable.
- •If repeated churn occurs, reduce automation scope and escalate to
human_review.
Policy misconfiguration
- •Run
autolab configure --check. - •Inspect
requirements_by_stagevs workflow capabilities. - •Validate
python_bin,dry_run_command, and retry policies. - •Apply minimal corrective edits; rerun verification.
Failure Playbooks
prompt_lint fails
- •
autolab verify --stage <stage> - •Inspect
.autolab/prompts/stage_<stage>.mdand.autolab/workflow.yamltoken contracts. - •Fix unsupported/missing tokens and rerun verification.
schema_checks fails
- •
autolab verify --stage <stage> - •
python .autolab/verifiers/schema_checks.py --stage <stage> --json - •Fix required fields/types against
.autolab/schemas/*.schema.json.
docs_targets fails
Ensure docs_update.md includes:
- •exact
metrics.jsonandrun_manifest.jsonartifact paths - •primary metric name, value, and delta
- •explicit no-target rationale when
paper_targetsis empty
Safe Defaults
- •Do not manually edit
.autolab/state.jsonto force transitions. - •Keep
docs/todo.mdin Markdown format. - •Keep guardrails enabled in unattended mode.
- •Prefer bounded loops (
--max-iterations,--max-hours). - •Use
auto_commit.mode: meaningful_onlyunless explicitly overridden.
Review Result Contract
review_result.json.required_checks is the fixed 5-key map:
- •
tests - •
dry_run - •
schema - •
env_smoke - •
docs_target_update
Do not add extra keys; use .autolab/verification_result.json for additional verifier evidence.