A command-line utility to extract text from PDF files using the Poppler library. It is significantly faster than pdftotext
, for which it serves as a minimal drop-in replacement (albeit with fewer options).
- GCC
- Poppler as well as its glib bindings (
poppler-glib
)
To install the program, run:
sudo make install
This will install the executable to /usr/local/bin
(or the path specified in the Makefile).
To extract text from a PDF file, run:
pdf2txt [--stdout] <path_to_pdf>
By default, the program will create a text file with the same name as the PDF file in the current directory. If you pass the --stdout
option, the extracted text will be printed to the standard output instead.
This project is released under the GNU General Public License v3.0.