Token Saver Context Compression
Use this skill to reduce token usage without MCP by running local Python scripts bundled with the skill. This package is self-contained and does not import project files outside this skill folder.
When to use
- •Context is large or expensive.
- •You need query-targeted compression.
- •You need evidence sufficiency checks before answering.
Commands
Run from project root, or keep absolute paths as shown.
- •Profile token usage:
bash
python "$CLAUDE_PROJECT_DIR/.claude/skills/token-saver-context-compression/scripts/profile_tokens.py" --file <path> --output-format auto
- •Compress context:
bash
python "$CLAUDE_PROJECT_DIR/.claude/skills/token-saver-context-compression/scripts/compress_context.py" --file <path> --mode baseline --output-format auto python "$CLAUDE_PROJECT_DIR/.claude/skills/token-saver-context-compression/scripts/compress_context.py" --file <path> --mode query_guided --query "<question>" --output-format auto python "$CLAUDE_PROJECT_DIR/.claude/skills/token-saver-context-compression/scripts/compress_context.py" --file <path> --mode evidence_aware --query "<question>" --min-similarity 0.4 --output-format auto
Optional external-payload adapters (for LangChain/LlamaIndex-style JSON):
bash
python "$CLAUDE_PROJECT_DIR/.claude/skills/token-saver-context-compression/scripts/compress_context.py" --json '<json-array>' --input-adapter langchain_json --mode query_guided --query "<question>" --output-format auto python "$CLAUDE_PROJECT_DIR/.claude/skills/token-saver-context-compression/scripts/run_skill_workflow.py" --json-file <payload.json> --input-adapter auto --mode evidence_aware --query "<question>" --output-format auto
- •Validate evidence:
bash
python "$CLAUDE_PROJECT_DIR/.claude/skills/token-saver-context-compression/scripts/validate_evidence.py" --file <path> --query "<question>" --min-similarity 0.4 --output-format json
- •Run all steps in one command:
bash
python "$CLAUDE_PROJECT_DIR/.claude/skills/token-saver-context-compression/scripts/run_skill_workflow.py" --file <path> --mode evidence_aware --query "<question>" --output-format auto --fail-on-insufficient-evidence
- •Run TOON-vs-JSON benchmark/guard checks:
bash
python "$CLAUDE_PROJECT_DIR/.claude/skills/token-saver-context-compression/scripts/benchmark_toon_vs_json.py"
Workflow policy
- •Profile first.
- •Prefer
query_guidedfor QA tasks. - •Use
evidence_awarefor correctness-sensitive tasks. - •Prefer
--output-format autoso TOON is chosen only when it is likely to reduce tokens. - •If evidence is insufficient, reduce compression aggressiveness or broaden retrieval.
Output contracts
- •Scripts emit JSON to stdout.
- •Scripts support
--output-format {json,toon,auto}. - •Scripts support
--json/--json-filewith--input-adapter {raw_json,langchain_json,llamaindex_json,auto}for framework payload normalization. - •In
auto, TOON is selected only when a uniform object-array shape crosses a row threshold. - •Mixed or irregular structures automatically stay in JSON.
- •
validate_evidence.pyexits1when insufficient evidence is detected. - •
run_skill_workflow.pycan fail on insufficient evidence when--fail-on-insufficient-evidenceis set. - •
benchmark_toon_vs_json.pyexits non-zero if guard thresholds fail.
Requirements
- •Python 3.10+
- •Optional:
tiktokenfor exact token counts (fallback counter works without it)