AgentSkillsCN

PDF Processing

在实施任何文本提取/解析逻辑之前调用此技能,以学习如何使用 LlamaParse 准确处理任何文档。需要 llama_cloud_services 包和 LLAMA_CLOUD_API_KEY 作为环境变量。

SKILL.md
--- frontmatter
name: PDF Processing
description: Invoke this skill BEFORE implementing any text extraction/parsing logic to learn how to use LlamaParse to process any document accurately. Requires llama_cloud_services package and LLAMA_CLOUD_API_KEY as an environment variable.

PDF Processing

Quick start

Extract text:

python
from llama_cloud_services import LlamaParse

parser = LlamaParse(
    parse_mode="parse_page_with_agent",
    model="openai-gpt-4-1-mini",
    high_res_ocr=True,
    adaptive_long_table=True,
    outlined_table_extraction=True,
    output_tables_as_HTML=True,
    result_type="markdown",
    project_id=project_id,
    organization_id=organization_id,
)

result = parser.parse("./my_file.pdf")
documents = result.get_markdown_documents(split_by_page=True)

full_text = ""

for document in documents:
    full_text += document.text + "\n\n---\n\n"

For more detailed code implementations, see REFERENCE.md.

Requirements

The llama_cloud_services package must be installed in your environment:

bash
pip install llama_cloud_services

And the LLAMA_CLOUD_API_KEY must be available as an environment variable:

bash
export LLAMA_CLOUD_API_KEY="..."