Edge Eval
Run eval snapshots, invariant checks, and reports.
Usage
- •
$edge-eval snapshot - •
$edge-eval check - •
$edge-eval run-bank - •
$edge-eval report
Instructions
- •Load
active_context.yamland readevalsconfig. - •If
evals.enabledis false, explain and stop. - •For
snapshot:- •Capture
state_beforein.proof/evals/<date>/run-<n>/before.json
- •Capture
- •For
check:- •Capture
state_after, compute diff, run invariant checks - •Log
eval_runentry to.proof/session_log.jsonl
- •Capture
- •For
run-bank:- •Iterate
evals.task_bank(default trials =evals.trials.count) - •Run snapshot+check for each trial
- •Summarize pass/fail
- •Iterate
- •For
report:- •Summarize last
eval_run(pass/fail + failed invariants)
- •Summarize last
Warnings:
- •Deterministic checks only.
- •If snapshots are truncated, skip invariants and surface a warning.