/pdf2txt-converter

A pdf-to-txt-converter that converts multiple files in a specified directory.

Primary LanguagePython

PDF-TXT-converter

Small and simple pdf-to-txt converter that uses the pdfminer package.

Requirements

Python (3.6 or above)
PDFMiner

How to use:

1. Clone the repository

git clone https://github.com/victoriamok/pdf2txt-converter.git

2. Create a virtual environment (for MacOS or Linux)

python3 -m venv <name_of_virtualenv>

To activate your virtual environment, run

source <name_of_virtualenv>/bin/activate

3. Install the required packages

cd <name_of_virtualenv>
pip3 install -r requirements.txt

4. Run the script

python3 pdf2txt.py /your/path/to/pdf/files/ /your/path/to/txt/files/