examples-auto-run
What it does
- •Runs
uv run examples/run_examples.pywith:- •
EXAMPLES_INTERACTIVE_MODE=auto(auto-input/auto-approve). - •Per-example logs under
.tmp/examples-start-logs/. - •Main summary log path passed via
--main-log(also under.tmp/examples-start-logs/). - •Generates a rerun list of failures at
.tmp/examples-rerun.txtwhen--write-rerunis set.
- •
- •Provides start/stop/status/logs/tail/collect/rerun helpers via
run.sh. - •Background option keeps the process running with a pidfile;
stopcleans it up.
Usage
bash
# Start (auto mode; interactive included by default) .agents/skills/examples-auto-run/scripts/run.sh start [extra args to run_examples.py] # Examples: .agents/skills/examples-auto-run/scripts/run.sh start --filter basic .agents/skills/examples-auto-run/scripts/run.sh start --include-server --include-audio # Check status .agents/skills/examples-auto-run/scripts/run.sh status # Stop running job .agents/skills/examples-auto-run/scripts/run.sh stop # List logs .agents/skills/examples-auto-run/scripts/run.sh logs # Tail latest log (or specify one) .agents/skills/examples-auto-run/scripts/run.sh tail .agents/skills/examples-auto-run/scripts/run.sh tail main_20260113-123000.log # Collect rerun list from a main log (defaults to latest main_*.log) .agents/skills/examples-auto-run/scripts/run.sh collect # Rerun only failed entries from rerun file (auto mode) .agents/skills/examples-auto-run/scripts/run.sh rerun
Defaults (overridable via env)
- •
EXAMPLES_INTERACTIVE_MODE=auto - •
EXAMPLES_INCLUDE_INTERACTIVE=1 - •
EXAMPLES_INCLUDE_SERVER=0 - •
EXAMPLES_INCLUDE_AUDIO=0 - •
EXAMPLES_INCLUDE_EXTERNAL=0 - •Auto-approvals in auto mode:
APPLY_PATCH_AUTO_APPROVE=1,SHELL_AUTO_APPROVE=1,AUTO_APPROVE_MCP=1
Log locations
- •Main logs:
.tmp/examples-start-logs/main_*.log - •Per-example logs (from
run_examples.py):.tmp/examples-start-logs/<module_path>.log - •Rerun list:
.tmp/examples-rerun.txt - •Stdout logs:
.tmp/examples-start-logs/stdout_*.log
Notes
- •The runner delegates to
uv run examples/run_examples.py, which already writes per-example logs and supports--collect,--rerun-file, and--print-auto-skip. - •
startuses--write-rerunso failures are captured automatically. - •If
.tmp/examples-rerun.txtexists and is non-empty, invoking the skill with no args runsrerunby default.
Behavioral validation (Codex/LLM responsibility)
The runner does not perform any automated behavioral validation. After every foreground start or rerun, Codex must manually validate all exit-0 entries:
- •Read the example source (and comments) to infer intended flow, tools used, and expected key outputs.
- •Open the matching per-example log under
.tmp/examples-start-logs/. - •Confirm the intended actions/results occurred; flag omissions or divergences.
- •Do this for all passed examples, not just a sample.
- •Report immediately after the run with concise citations to the exact log lines that justify the validation.