AgentSkillsCN

eng-verification

在认定工作已完成之前,必须执行强制性检查清单——先运行生成证据的命令,仔细检查输出结果,唯有确认无误后方可正式声明任务完成。

SKILL.md
--- frontmatter
name: eng-verification
description: Mandatory checklist before claiming work is complete—run the evidence-producing command, inspect output, and only then state a result.

Verification Before Completion

Overview

Claiming success without evidence is dishonesty, not efficiency. Every status report, “done,” or “tests pass” statement requires fresh verification.

Iron Law

code
NO COMPLETION CLAIMS WITHOUT FRESH VERIFICATION EVIDENCE

If you didn’t run the verifying command in this message, you cannot claim it passes.

Gate Function

code
1. IDENTIFY: Which command proves the claim?
2. RUN: Execute the full command now (fresh run).
3. READ: Inspect entire output & exit code.
4. VERIFY: Does output prove the claim?
   - If NO → report actual status with evidence
   - If YES → state claim WITH evidence (command + summary)
5. ONLY THEN: Make the claim

Skipping any step = lying.

Common Claims & Required Evidence

ClaimRequiredNot Enough
“Tests pass”Latest test command output showing 0 failuresPrevious run, “should pass”
“Lint clean”Linter command exit 0 nowPartial subset, inference
“Build works”Build command exit 0Linter pass
“Bug fixed”Reproduction test now passesCode diff only
“Regression test exists”Red→Green verified cycleTest created but never failed
“Agent succeeded”Inspect VCS diff + verify outputsAgent success message

Red Flags – STOP

  • Saying “should/probably/seems”
  • Expressing satisfaction before running verification
  • Planning to commit/push/PR without a fresh run
  • Trusting agent reports without checking
  • Partial verification (“just ran affected tests”)
  • “Just this once” or “I’m tired” rationalizations

Rationalization Rebuttals

ExcuseReality
“Should work now”Run the command
“I’m confident”Confidence ≠ evidence
“Just this once”No exceptions
“Agent said success”Verify independently
“Partial check is fine”Partial proves nothing
“I’m exhausted”Exhaustion isn’t evidence

Patterns

  • Tests: run and show X/Y passing output.
  • Regression tests: demonstrate fail (red), apply fix, pass (green), revert to confirm failure if necessary.
  • Requirements: re-read spec, create checklist, tick items with evidence.
  • Agent work: inspect actual diff, run verification yourself.

When to Apply

Always before:

  • Claiming completion/success
  • Expressing satisfaction (“Great! Done!”)
  • Moving to next task
  • Committing, pushing, or filing PRs
  • Delegating completion to others/agents

Bottom Line

Evidence precedes claims. Run the command, read the output, then speak. Anything else breaks trust.