AgentSkillsCN

pdf

终极进攻型安全技能。

SKILL.md
--- frontmatter
version: 4.1.0-fractal
name: pdf
description: Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.
license: Proprietary. LICENSE.txt has complete terms

PDF Processing Guide

Overview

This guide covers essential PDF processing operations using Python libraries and command-line tools. For advanced features, JavaScript libraries, and detailed examples, see reference.md. If you need to fill out a PDF form, read forms.md and follow its instructions.

Quick Start

python
from pypdf import PdfReader, PdfWriter

# Read a PDF
reader = PdfReader("document.pdf")
print(f"Pages: {len(reader.pages)}")

# Extract text
text = ""
for page in reader.pages:
    text += page.extract_text()

Python Libraries

🧠 Knowledge Modules (Fractal Skills)

1. pypdf - Basic Operations

2. pdfplumber - Text and Table Extraction

3. reportlab - Create PDFs

4. pdftotext (poppler-utils)

5. qpdf

6. pdftk (if available)

7. Extract Text from Scanned PDFs

8. Add Watermark

9. Extract Images

10. Password Protection