Overview of Parse (original) (raw)

Introduction to Parse, a document parsing service that transforms unstructured documents into LLM-ready data with support for various file types and output formats.

Get complex documents into LLM-readable formats

LlamaParse turns complex, messy documents into structured, LLM-ready content by combining OCR with customized parsing agents. Whether your source is scanned PDFs, images, or native digital files, you get clean text, markdown, or JSON that fits straight into your models and pipelines.

parse document comparison

Flexible Parsing - Choose between Cost Effective, Agentic, and Agentic Plus tiers to handle everything from simple text to visually complex documents.
Broad File Support - Parse PDFs, DOCX, PPTX, XLSX, HTML, JPEG, XML, EPUB, and many more →.
Chart Parsing - Extract charts and visualizations into structured data so LLMs and downstream tools can reason over them.
Multimodal & Custom Output - Accurately extract tables, charts, images, and diagrams into structured formats. Use custom prompt instructions to tailor the output the way you want it.

Get started on the cloud

Section titled “Get started on the cloud”

Sign up for LlamaCloud to create an account and get an API key. Then use the web UI, Python SDK, or REST API to start parsing.

Connect your documents
Upload or stream documents via our API, Clients, or UI—with built-in connectors to sync with enterprise data sources.
Configure your parsing
Select a preset for a quick start, or define a custom configuration with specific models, output formats, and parsing instructions tailored to your use case.
Get clean, structured results
Receive parsed output in text, markdown, or JSON—ready to plug into your application, database, or LLM pipeline.

High-quality document parsing is one of the most overlooked—yet crucial—steps in the LLM stack. Models can only reason with the information you give them, and most documents today are hard for LLMs to interpret out of the box.

LlamaParse was built to solve this problem from the ground up. Unlike generic OCR or PDF-to-text tools, LlamaParse uses AI-native methods to understand structure, layout, and intent—ensuring every output is optimized for downstream LLM consumption.