Citation Verifier
Generate citations/ref.bib and ensure every entry has a traceable verification record in citations/verified.jsonl.
When network access is restricted, prefer a “record now, verify later” workflow: keep URLs/titles consistent and leave a clear verification note.
Input
- •
papers/paper_notes.jsonl
Outputs
- •
citations/ref.bib - •
citations/verified.jsonl
Workflow (heuristic)
- •Collect
bibkey,title,url,year,authorsfrompapers/paper_notes.jsonl. - •Write/refresh
citations/ref.bib:- •Prefer arXiv-style fields when
arxiv_id/primary_categoryexist (eprint,archivePrefix,primaryClass).
- •Prefer arXiv-style fields when
- •Write one verification record per BibTeX entry to
citations/verified.jsonlwith at least:- •
bibkey,title,url,date
- •
- •If you cannot verify via network, record a clear
notesfield (e.g., “auto-generated; needs manual verification”) and/or request human confirmation depending on your policy.
Quality checklist
- • Every BibTeX entry has a corresponding
verified.jsonlrecord. - • No missing
url/date/titlein verification records.
Offline Mode
When network access is restricted, run in offline mode to produce auditable records now, then verify later.
- •Generate offline records:
verification_status: offline_generated - •Verify later (when network is available):
--verify-only
verification_status
- •
offline_generated: record was generated without network verification (needs later verification) - •
verified_online: URL/title verified successfully by the script - •
verify_failed: verification was attempted but failed (network error or title mismatch) - •
needs_manual_verification: missing/ambiguous fields (e.g., emptyurl/title)
Script
Quick Start
- •
python .codex/skills/citation-verifier/scripts/run.py --help - •Offline (record now, verify later):
python .codex/skills/citation-verifier/scripts/run.py --workspace <workspace_dir> --offline
All Options
- •
--offline: do not attempt network verification; writeverification_status=offline_generated - •
--verify-only: verify existingcitations/verified.jsonlrecords (does not rewrite BibTeX) - •
--verification-note <text>: stored incitations/verified.jsonlnotes
Examples
- •Generate BibTeX + offline verification records:
- •
python .codex/skills/citation-verifier/scripts/run.py --workspace <ws> --offline --verification-note "auto-generated; needs manual verification"
- •
- •Later, verify-only (when network is available):
- •
python .codex/skills/citation-verifier/scripts/run.py --workspace <ws> --verify-only
- •
Notes
- •Minimal requirement for every verification record:
url,date,title. - •The script sanitizes stray/unbalanced
{}in titles to keepbibtexparsing robust. - •The script escapes LaTeX special chars in text fields (
& % $ # _) and rewrites superscript patterns likeX^NorX$^N$asX\textsuperscript{N}to keep LaTeX builds stable. - •URLs are kept raw in BibTeX
urlfields (BibTeX styles wrap them with\url{...});@miscuseshowpublished=\url{...}. - •In offline mode, records are not truly verified; treat
offline_generatedas a to-do for human/network verification.
Troubleshooting
Common Issues
Issue: Missing bibkey / missing url in notes
Symptom:
- •
citations/ref.bibis missing entries, orverified.jsonlhas emptyurl/title.
Causes:
- •
papers/paper_notes.jsonllacksbibkey/urlfields.
Solutions:
- •Ensure each core paper note has a stable
bibkeyand a canonicalurl. - •Rerun citation generation after fixing notes.
Issue: verification_status=offline_generated
Symptom:
- •Records exist but are not truly verified.
Causes:
- •
--offlinewas used, or network verification was unavailable.
Solutions:
- •When network is available, run
--verify-onlyto upgrade records. - •Or manually verify and update
citations/verified.jsonlwith notes.
Recovery Checklist
- • Every BibTeX entry has a matching
citations/verified.jsonlrecord. - • Verification records include
url,date,title.