This Python script allows you to extract images from each page of a PDF file and save them in a specified folder with a given extension.
Before running the script, you need to install the required Python libraries. You can install them using the provided requirements.txt
file.
pip install -r requirements.txt
To use this script, run it from the command line and specify the input PDF file, the output folder, and the desired image extension. For example, this command extracts images from input.pdf
and saves them as JPEG files in the output_folder
:
python main.py -i input.pdf -o output_folder -e .jpeg
-h
,--help
: Display help information showing all command-line options.-i
,--input
: Specify the PDF file to process. (required)-o
,--output
: Specify the folder where the extracted images will be saved. (required)-e
,--extension
: Specify the image file extension (e.g.,.jpeg
,.png
). (default:.png
)