Command-Line Interface — Camelot 2.0.0 documentation (original) (raw)
Contents
Command-Line Interface#
Camelot comes with a command-line interface.
You can print the help for the interface by typing camelot --help in your favorite terminal program, as shown below. Furthermore, you can print the help for each command by typing camelot <command> --help. Try it out!
Running without installing (uvx)#
If you only want to use the CLI ad-hoc, uvx lets you run it without installing Camelot into the current environment:
$ uvx camelot-py lattice --output tables.csv document.pdf
The camelot-py console script is an alias for camelot matching the PyPI package name, so the older uvx --from camelot-py camelot … invocation also still works.
Format inference#
--format is optional — when omitted, Camelot infers the format from the --output path’s extension. Supported extensions:
So this works:
$ camelot-py lattice --output tables.xlsx document.pdf
equivalent to: camelot-py lattice --format excel --output tables.xlsx document.pdf
Output is a template#
--output is treated as a template — each detected table is written to <output_stem>-page-<P>-table-<T>.<ext>. So --output report.csv on a document with 2 tables on page 1 and 1 table on page 3 produces report-page-1-table-1.csv, report-page-1-table-2.csv, report-page-3-table-1.csv.
camelot#
Camelot: PDF Table Extraction for Humans.
Usage
camelot [OPTIONS] COMMAND [ARGS]...
Options
--version#
Show the version and exit.
Commands
hybrid
Combines the strengths of both the Network…
lattice
Use lines between text to parse the table.
network
Use text alignments to parse the table.
stream
Use spaces between text to parse the table.