larxiver is a tiny script to compress latex projects by only including used files and optionally downscaling images. It keeps the original directory structure and is intended to ease the publishing process. Archives generated with this script can be uploaded to arxiv directly.
./larxiver.py -i /path/to/paper.tex -o /path/to/output/arxiv
- -i: Input latex file
- -o: Output zip file
- -r: Target DPI, (optional, default 0, copies without rescaling)
- -k: Keep temporary files (optional)
- -v: Verbose output (optional)
- -vv: Very verbose output (optional)
- pdflatex
- python3
- pillow
larxiver analyzes the log file of pdflatex and records all input files and the used size of image files. Input files are copied to a temporary output directory, which is later zipped; and images are scaled to produce a given DPI in this process.
larxiver is useful because it keeps your latex structure. If this is not required and the goal is to compress a PDF file, you could use Ghostscript instead:
# If your pdf contains uncompressed images, say PNGs, this will convert them to JPG and reduce the file size significantly
gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -dDownsampleColorImages=false -dNOPAUSE -dBATCH -sOutputFile=paper_web.pdf paper.pdf
# If you want to adjust the resolution too
gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -dDownsampleColorImages=true -dColorImageResolution=150 -dNOPAUSE -dBATCH -sOutputFile=paper_web.pdf paper.pdf
Note, however, that larxiver can produce better results in some cases as it is not converting images to JPG and therefore not introducing compression artifacts.