thesis-disagreement-report
When to use
- •You want to find where keyword gating is too permissive or too strict compared to LLM.
Inputs
- •days window.
- •keyword threshold for treating keyword_score as 'match'.
- •optional output path.
Workflow
- •Run:
python -m ops.cli quality thesis-disagreement-report --days 30 --keyword-threshold 0.4 --out /tmp/report.md. - •Review examples and adjust keyword rules or negative keywords accordingly.
Outputs
- •Markdown report listing example signal_ids for keyword FP/FN.
Guardrails
- •Disagreement does not imply keyword is wrong; validate against real outcomes whenever possible.
References
- •
references/reference.md - •
docs/QUALITY_OPS_ARCHITECTURE.md