Systematic Debugging

Overview

Random fixes waste time and create new bugs. Quick patches mask underlying issues.

Core principle: ALWAYS find root cause before attempting fixes. Symptom fixes are failure.

Violating the letter of this process is violating the spirit of debugging.

code

NO FIXES WITHOUT ROOT CAUSE INVESTIGATION FIRST

If you haven't completed Phase 1, you cannot propose fixes.

Use for ANY technical issue:

Use this ESPECIALLY when:

You MUST complete each phase before proceeding to the next.

BEFORE attempting ANY fix:

Find the pattern before fixing:

•
Find Working Examples
- •Locate similar working code in same codebase
- •What works that's similar to what's broken?
•
Compare Against References
- •If implementing pattern, read reference implementation COMPLETELY
- •Don't skim - read every line
- •Understand the pattern fully before applying
•
Identify Differences
- •What's different between working and broken?
- •List every difference, however small
- •Don't assume "that can't matter"
•
Understand Dependencies
- •What other components does this need?
- •What settings, config, environment?
- •What assumptions does it make?

Scientific method:

•
Form Single Hypothesis
- •State clearly: "I think X is the root cause because Y"
- •Write it down
- •Be specific, not vague
•
Test Minimally
- •Make the SMALLEST possible change to test hypothesis
- •One variable at a time
- •Don't fix multiple things at once
•
Verify Before Continuing
- •Did it work? Yes -> Phase 4
- •Didn't work? Form NEW hypothesis
- •DON'T add more fixes on top
•
When You Don't Know
- •Say "I don't understand X"
- •Don't pretend to know
- •Ask for help
- •Research more

Fix the root cause, not the symptom:

•
Create Failing Test Case
- •Simplest possible reproduction
- •Automated test if possible
- •MUST have before fixing
•
Implement Single Fix
- •Address the root cause identified
- •ONE change at a time
- •No "while I'm here" improvements
- •No bundled refactoring
•
Verify Fix
- •Test passes now?
- •No other tests broken?
- •Issue actually resolved?
•
If Fix Doesn't Work
- •STOP
- •Count: How many fixes have you tried?
- •If < 3: Return to Phase 1, re-analyze with new information
- •If >= 3: STOP and question the architecture
- •DON'T attempt Fix #4 without architectural discussion
•
If 3+ Fixes Failed: Question Architecture
- •Is this pattern fundamentally sound?
- •Are we "sticking with it through sheer inertia"?
- •Should we refactor architecture vs. continue fixing symptoms?
- •Discuss with your human partner before attempting more fixes

If you catch yourself thinking:

ALL of these mean: STOP. Return to Phase 1.

Phase	Key Activities	Success Criteria
1. Root Cause	Read errors, reproduce, check changes, gather evidence	Understand WHAT and WHY
2. Pattern	Find working examples, compare	Identify differences
3. Hypothesis	Form theory, test minimally	Confirmed or new hypothesis
4. Implementation	Create test, fix, verify	Bug resolved, tests pass

From debugging sessions: