pdf processing

PDF Processing

Quick start

Extract text:

from llama_cloud_services import LlamaParse

parser = LlamaParse( parse_mode="parse_page_with_agent", model="openai-gpt-4-1-mini", high_res_ocr=True, adaptive_long_table=True, outlined_table_extraction=True, output_tables_as_HTML=True, result_type="markdown", project_id=project_id, organization_id=organization_id, )

result = parser.parse("./my_file.pdf") documents = result.get_markdown_documents(split_by_page=True)

full_text = ""

for document in documents: full_text += document.text + "\n\n---\n\n"

For more detailed code implementations, see REFERENCE.md.

Requirements

The llama_cloud_services package must be installed in your environment:

pip install llama_cloud_services

And the LLAMA_CLOUD_API_KEY must be available as an environment variable:

export LLAMA_CLOUD_API_KEY="..."

pdf processing

Safety Notice

Copy this and send it to your AI assistant to learn

Source Transparency

Related Skills

extract structured data from unstructured files (pdf

pdf-processing

pdf processing