Debugger (Evidence-Driven Debugging)

Outcome

•Deliver an evidence-backed root cause and a verified fix (or clearly state what evidence is missing and how to obtain it).
•Always produce a reproducible failing command/test (or a concrete explanation of why reproduction is blocked).

•Reproduce first unless the user explicitly forbids it.
•
No guessing. Avoid language like “maybe/probably/possibly”.
- •If you must reason under uncertainty, label it a testable hypothesis and immediately propose the single experiment that can falsify it.
•Every conclusion must be tied to concrete evidence: a command output, a stack trace, a log line, a file+line, or a test.
•After any fix, re-run the reproduction (and relevant tests) to verify the bug is gone.
•Keep an audit trail: commands you ran + files you changed + the observed results.

•Expected vs actual behavior (one sentence each).
•Exact reproduction steps: commands, inputs, configs, data, and where to run them.
•Environment: OS/arch, language/runtime versions, dependency versions, commit hash, build flags.
•Frequency: always vs sometimes; any correlation (time, load, specific input, specific machine).
•Constraints: no network? no root? cannot run heavy load? cannot touch prod data? (safety).
•Success criteria: what observable output/log means “fixed”.

•Run the user-provided steps exactly. Capture stdout/stderr + exit code.
•Reduce to the smallest deterministic reproducer (single test, single command, smallest input).
•
If you cannot reproduce:
- •Say explicitly: “I cannot reproduce this yet.”
- •List the missing evidence (logs, inputs, exact command, environment mismatch).
- •Propose 1–3 concrete experiments to obtain that evidence.

Tip: Use a disposable workdir under /tmp to keep artifacts (logs, traces, repro inputs) tidy:

•workdir="$(scripts/mk_workdir.sh)"
•scripts/capture_cmd.sh -- <your-command> (captures output to a file and prints the workdir path)

•
If the system already has logs, read them before adding new instrumentation:
- •Services (systemd): systemctl status …, journalctl -u … -b …
- •Kernel/system: dmesg, journalctl -b
•If the app has a log level, increase to debug/trace and re-run reproduction.

•Add logs at the boundary between “expected” and “observed”.
•Log inputs, derived values, and invariants; add correlation IDs; avoid secrets.
•Keep it minimal (don’t spam logs); remove/guard after the fix if appropriate.

•Pick the best-fitting tool for the stack (gdb/lldb, language debugger, syscall trace, profiler).
•Extract evidence: stack trace, signal, syscall trace, profile flamegraph, core dump analysis.

See references/cheatsheet.md for quick commands/knobs by scenario.

Repeat until a single root cause is proven:

•Repro: exact command(s) / inputs / where you ran them
•Facts (Evidence): key logs/trace/output excerpts (with file+line when relevant)
•Root cause: a single statement, backed by evidence (no speculation)
•Fix: minimal patch summary
•Verification: commands/tests re-run + outcomes
•Next steps / questions: only if blocked; each question must be actionable