This project converts PDF documents into clean CSV spreadsheets. It's designed to help you extract tables and data from PDFs easily, so you can use them in Excel, Google Sheets, databases, or for further analysis.
You can customize the tool's behavior using the following command-line flags:
| Flag | Type | Default Value | Description |
|---|---|---|---|
--data |
string | assets/ |
Specifies the directory containing the data files (e.g., PDF documents) to be extracted. |
--output |
string | output/ |
Specifies the directory where the extracted CSV files (or other output) will be saved. |
# Extract from a custom data directory and save to a custom output directory
make build
./extractor --data /path/to/my/pdfs --output /my/output/folder
# Use default data and output directories
make run