Document Reader Skill

Use this skill when the user provides (or references) local files and you need a reliable way to ingest them into the agent workflow:

•Unstructured docs: PDFs, images, scanned or handwritten notes (returned as base64 + mime)
•Structured docs: JSON, NDJSON, CSV/TSV (returned as text plus optional parsing)
•Plain text: TXT/MD/etc.

This avoids ad-hoc “write a quick Python function to read X” every time.

Prerequisites (Local)

bash

./scripts/local-test.sh document-reader 7078

•Reads are workspace-only by default. To read outside the repo, set allow_outside_workspace=true.
•Large files are blocked or truncated via max_bytes / max_chars.

Read a PDF or image as base64 (for downstream OCR/vision or archive):

json

{
  "path": "data/intake/scanned_note.jpg",
  "mode": "binary",
  "include_data_url": true
}

Read JSON (returns both text and parsed json when valid):

json

{
  "path": "data/sample_cases/prior_auth_baseline/pa_request.json",
  "mode": "text",
  "parse_structured": true
}

Read CSV (returns rows up to max_rows):

json

{
  "path": "data/input/patients.csv",
  "mode": "text",
  "max_rows": 200
}

•Call read_document for each referenced file path.
•For binaries (PDF/images), use the returned mime + data_url/base64 to drive downstream extraction.
•Produce a de-identified structured summary (never commit PHI or secrets).