Code Quality: Testing

Name: code-quality-testing
Rating: 88
Author: pkuppens

Tests are production code. Apply the same quality standards.

Structure

•Every test tests one behavior. A failing test should tell you exactly what broke without reading the test body.
•Test names are sentences: test_chunker_splits_on_sentence_boundary_not_word_boundary is better than test_chunker_2.
•Test docstrings: "As a user I want [objective], so I can [benefit]." for user-facing tests. For technical/utility tests, use full docstring with Args/Returns or a concise one-liner. See test-documentation.mdc.

•Prefer narrow, fast unit tests. Mock external deps (Ollama, ChromaDB, auth) at the boundary — not deep inside logic.
•Mark slow/external tests. Any test touching network, Docker, or Ollama gets the right marker (@pytest.mark.slow, @pytest.mark.ollama, etc.).
•Avoid duplication. If several tests set up the same fixture with minor variations, use @pytest.mark.parametrize.

•Mutation testing (mutmut) guidance: Assert specific outcomes. An assertion that passes when the condition is inverted doesn't test anything. Each test must assert a definite result, not just "no exception raised".
•Coverage is a floor, not a goal. 100% coverage with weak assertions is worse than 80% with assertions that catch mutations. Prioritize behavioral coverage over line coverage.