PDF Processing

Name: pdf-processing
Rating: 65
Author: pcingola

When to use this skill

Use this skill when the task involves reading, extracting, or transforming content from PDF documents.

For standard extraction, run the bundled script:

code

python scripts/extract.py <file.pdf>

The script reads the PDF and outputs structured content with page numbers and detected tables.

Return extracted text with page numbers and any detected tables in a structured format.

If you encounter scanned PDFs or complex layouts, OCR processing may be required.