Test Refactor

Ask the user for the test file to refactor. Work on one test file at a time.

Process

Work through these steps in order, making a separate commit for each step that makes changes:

•Read the source file being tested
•Count decision points: if, else, for, while, case, catch, &&, ||, ?:
•Cyclomatic complexity = 1 + decision points
•This is your TARGET test count (approximately)
•Report: "Cyclomatic complexity: ~N paths, currently M tests"
•If M >> N, tests need consolidation

•First: identify what the PUBLIC API is (what external callers use)
•Functions exported only to enable testing are NOT public API - make them private
•Private methods tested indirectly through public interface
•If function A only calls function B internally, test through A, make B private
•Example: buildBotRequest calls buildToolsForAgent → test through buildBotRequest, unexport buildToolsForAgent

•Audit/logging callbacks → implementation detail, remove
•Caching behavior → implementation detail, remove
•Tests that verify Promise.all/parallel execution → testing the language, remove
•If behavior matters to users, keep; if it's internal wiring, remove

•Target: ~1 test per code path through the function
•Merge tests covering same code path with different assertions
•Example: "returns response" + "propagates nestedTools" + "propagates suggestedActions" → one test checking all output fields
•Example: "MCP returns error" + "MCP throws exception" → one test for error handling

•Tests testing implementation → refactor to test behavior
•Conditional logic in tests → split into separate tests
•Order-dependent tests → make independent
•toHaveBeenCalledWith on non-void functions → stub return value instead (Query Command Separation: queries are stubs, commands are mocks)
•expect(arr).toHaveLength(n) followed by element check → remove length check, let element assertion fail directly
•If test level seems wrong for component, ASK USER with recommendation

•
Present 3 options for unclear tests:
1. •Rename test to describe behavior
2. •Improve assertion messages
3. •Both
•Wait for user decision

•Acceptance tests: Use mock web servers (http), TestContainers (DB)
•Unit tests: Only mock external dependencies (filesystem, random, time, http, DB)
•Prefer test doubles over mocking frameworks
•Use real objects when possible

After EACH step:

•TDD: Write tests first
•Acceptance tests for behavior, unit tests for complex pure functions
•
Acceptance level depends on component:
- •Microservices: Event or HTTP level
- •UI: Route-level component rendering
•Test count ≈ cyclomatic complexity (one test per code path)

•If no refactoring needed, report "No test refactoring needed"
•Otherwise provide brief summary (e.g., "Reduced 15 → 8 tests to match cyclomatic complexity, removed implementation detail tests")

Use Arlo's Commit Notation (ACN):