Testing

Stack-adaptive testing patterns and practices. This skill defines testing principles and methodologies, not specific framework commands.

Design Principles

•Discover, don't assume: Detect what testing tools the project uses
•Project-first: Use the project's configured test commands
•Patterns over tools: Teach testing concepts that apply to any framework

Test Pyramid

code

         /\
        /E2E\        Few: Critical user journeys (5-10%)
       /------\
      / Integ  \     Some: Service boundaries (15-25%)
     /----------\
    /   Unit     \   Many: Business logic (65-80%)
   /--------------\

Test Structure

Arrange-Act-Assert (AAA)

Universal pattern for all test frameworks:

code

Arrange: Set up test data and dependencies
Act:     Execute the code under test
Assert:  Verify expected outcomes

Naming Convention

code

[unit_being_tested]_[scenario]_[expected_result]

Examples:
- createUser_withValidData_returnsUser
- calculateTotal_withEmptyCart_returnsZero
- login_withInvalidCredentials_throwsAuthError

Framework Detection

Step 1: Discover Test Configuration

Look for test configuration files in the project:

bash

# List potential test config files
ls -la *test*.config.* *jest*.* *vitest*.* pytest.ini setup.cfg pyproject.toml 2>/dev/null

# Check for test directories
ls -d tests/ test/ __tests__/ spec/ *_test/ 2>/dev/null

# Check package.json/pyproject.toml for test scripts
grep -A 5 '"test"' package.json 2>/dev/null
grep -A 5 '\[tool.pytest' pyproject.toml 2>/dev/null

Step 2: Use Project's Test Commands

Always prefer project-defined scripts:

bash

# Check for defined test scripts
grep -E '"test":|"test:' package.json 2>/dev/null    # JavaScript
grep -A 3 '\[scripts\]' pyproject.toml 2>/dev/null   # Python
cat Makefile 2>/dev/null | grep -E '^test:'          # Any language

Run using the project's configured command:

•If npm test is configured → use npm test
•If make test exists → use make test
•Otherwise, search for the standard command for the detected framework

Step 3: Research Framework Commands

If the test framework is unfamiliar, use WebSearch:

code

WebSearch: "[framework name] run tests command [year]"
WebFetch: [official docs] → "Extract test execution commands"

Unit Testing Principles

What to Unit Test

•Business logic and algorithms
•Edge cases and boundary conditions
•Error conditions and exception handling
•Input validation
•Pure functions (no side effects)

What NOT to Unit Test

•Framework/library code
•Simple getters/setters
•Implementation details (test behavior, not structure)
•Third-party integrations (use integration tests)

Mocking Guidelines

When to Mock:

•External services (APIs, databases)
•Time-dependent operations (dates, timers)
•Random number generation
•File system operations
•Network requests

When NOT to Mock:

•The code under test itself
•Simple utility functions
•Internal domain logic
•Anything that's fast and deterministic

Integration Testing

API Testing Pattern

code

1. Set up test database/state
2. Make HTTP request to endpoint
3. Assert response status code
4. Assert response body structure and values
5. Assert side effects (database state changes)
6. Clean up test data

Database Testing Strategies

Strategy	Description	Use When
Transaction rollback	Run test in transaction, rollback after	Fast isolation needed
Test database	Separate database for tests	Full integration testing
Seed and clean	Load fixtures, clean after	Realistic data scenarios
Containers	Ephemeral DB per test run	CI/CD environments

E2E Testing

Locator Priority

Best practices for finding elements (applies to most E2E frameworks):

•Accessibility roles - Most robust, works like users interact
•Labels - For form inputs
•Text content - For visible content
•Test IDs - Last resort, explicit for testing

Page Object Model

Organize E2E tests with page abstractions:

code

pages/
  login.page.[ext]
  dashboard.page.[ext]
  settings.page.[ext]

tests/
  login.spec.[ext]
  dashboard.spec.[ext]

E2E Best Practices

•Test critical user journeys, not everything
•Use realistic but consistent test data
•Avoid flaky selectors (prefer stable identifiers)
•Keep E2E test count low (pyramid principle)
•Run E2E tests in isolated environments

Test Data Management

Factories

Use factories for dynamic test data:

•Generate realistic data programmatically
•Override specific fields as needed
•Avoid hardcoded magic values in tests

Fixtures

Use fixtures for stable scenarios:

•Predefined user accounts for login tests
•Reference data sets that don't change
•Configuration presets for different scenarios

Coverage Guidelines

Target Thresholds

Metric	Suggested Target
Statements	80%
Branches	80%
Functions	80%
Lines	80%

Coverage Commands

Discover coverage commands from project configuration:

bash

# Check for coverage scripts
grep -E 'coverage|cov' package.json 2>/dev/null
grep -E 'coverage|cov' pyproject.toml 2>/dev/null
grep -E 'coverage|cov' Makefile 2>/dev/null

If no coverage script exists, search for the framework's coverage option:

code

WebSearch: "[test framework name] code coverage command"

Test Quality Checklist

• Tests are independent (no shared mutable state)
• Tests are deterministic (same result every run)
• Tests clean up after themselves
• Tests are fast (unit < 100ms typical)
• Test names describe what they verify
• Tests cover both success and error cases
• No hardcoded paths or environment-specific values

Debugging Failing Tests

Process

•Read the failure message - Understand expected vs actual
•Run test in isolation - Verify it's not dependent on other tests
•Add logging - Print intermediate values
•Check test setup - Verify preconditions are met
•Verify mocks - Ensure mocks return expected values
•Check for flakiness - Run multiple times

Flaky Test Indicators

•Test passes/fails inconsistently
•Test depends on timing or order
•Test uses real external services
•Test has race conditions in async code

Flaky Test Management

Flaky tests undermine CI reliability and developer trust. This section provides systematic approaches to detect, diagnose, and fix them.

Detecting Flaky Tests

Local Detection:

bash

# Run test multiple times to detect flakiness
for i in {1..10}; do npm test -- --testNamePattern="suspect test" && echo "Pass $i" || echo "FAIL $i"; done

# For pytest
for i in {1..10}; do pytest path/to/test.py::test_name -x && echo "Pass $i" || echo "FAIL $i"; done

CI Detection Patterns:

•Same test fails intermittently across different PRs
•Test passes on retry without code changes
•Test fails only on certain CI runners or times

Common Flaky Test Causes

Cause	Symptom	Solution
Timing dependencies	Fails with "timeout" or inconsistent timing	Use explicit waits, not sleeps
Shared state	Fails when run with other tests, passes alone	Isolate test data, use fixtures
Order dependency	Fails when test order changes	Make each test independent
External services	Network errors, rate limits	Mock external dependencies
Race conditions	Intermittent with async code	Proper async/await handling
Resource exhaustion	Fails late in test suite	Clean up resources, increase limits
Time-based logic	Fails at certain times (midnight, DST)	Mock time/date functions
Random data	Different results with random inputs	Seed random generators

Fixing Flaky Tests

Strategy 1: Replace Sleeps with Explicit Waits

code

# Bad: Fixed sleep (may be too short or wasteful)
sleep(2)
assert element.visible

# Good: Wait for condition with timeout
wait_for(lambda: element.visible, timeout=5)

Strategy 2: Isolate Test Data

code

# Bad: Tests share database state
def test_create_user():
    create_user("john")  # May conflict with other tests

# Good: Each test has isolated data
def test_create_user(unique_user_factory):
    user = unique_user_factory()  # Generates unique user per test

Strategy 3: Mock Time-Dependent Code

code

# Bad: Uses real time
def test_expiration():
    token = create_token(expires_in=60)
    time.sleep(61)
    assert token.is_expired()

# Good: Mocks time
def test_expiration(mock_time):
    token = create_token(expires_in=60)
    mock_time.advance(61)
    assert token.is_expired()

Strategy 4: Seed Random Generators

code

# Bad: Non-deterministic
def test_random_selection():
    result = pick_random_item(items)  # Different each run

# Good: Seeded for reproducibility
def test_random_selection():
    random.seed(42)  # Or use pytest-randomly with --randomly-seed
    result = pick_random_item(items)

Quarantine Strategy

When a flaky test can't be immediately fixed:

•Mark as flaky - Use framework-specific skip/xfail annotations
•Document the issue - Create a ticket with reproduction steps
•Exclude from blocking CI - Move to non-blocking test suite
•Set fix deadline - Quarantine should be temporary

code

# Example: Jest
test.skip('flaky: issue #123 - timing issue in CI', () => {...})

# Example: pytest
@pytest.mark.skip(reason="Flaky: issue #123 - race condition")
def test_flaky_feature(): ...

# Example: Track in separate CI job
# .github/workflows/ci.yml
- name: Run stable tests (blocking)
  run: npm test -- --testPathIgnorePatterns="flaky"
- name: Run flaky tests (non-blocking)
  run: npm test -- --testPathPattern="flaky" || true

CI Retry Strategies

Framework Options:

•Jest: jest --runInBand (sequential), jest-circus retry
•pytest: pytest-rerunfailures plugin
•CI: Built-in retry (GitHub Actions retry, GitLab retry:)

When to Use Retries:

•As temporary mitigation while fixing root cause
•For genuinely transient issues (network glitches)
•NOT as permanent solution for broken tests

Flaky Test Prevention

Practice	Benefit
Run tests in random order	Catches order dependencies early
Run in CI before merge	Catches environment differences
Use test containers	Consistent database/service state
Mock external services	Eliminates network flakiness
Avoid global state	Prevents test interference
Set CI timeouts appropriately	Catches slow/hung tests

Rules (L1 - Hard)

Critical for test reliability and safety.

•NEVER share mutable state between tests (causes flakiness)
•NEVER use hardcoded delays (use proper async waiting)
•ALWAYS write tests BEFORE fixing bugs (reproduce first)

Defaults (L2 - Soft)

Important for effective testing. Override with reasoning when appropriate.

•Discover the project's test framework before running tests
•Use the project's configured test commands
•Test edge cases and error conditions
•Test behavior, not implementation details
•Document reason when skipping tests

Guidelines (L3)

Recommendations for comprehensive test coverage.

•Consider using AAA pattern (Arrange-Act-Assert) for clarity
•Prefer factories for dynamic test data
•Consider coverage targets around 80%