Command-line ArXiv & CVF Open Access Paper Downloader. [PyPI] [Source]
Disclaimer: This is a highly-opinionated CLI tool for downloading papers. It priorities ease of use for researchers. Obviously, this is not an official project.
- Support downloading papers from ArXiv, CVPR, ICCV, WACV via simple CLI.
- Support downloading speedup by using aria2c.
- Retrieve the paper's metadata such as:
- Title, Abstract, Year
- Authors
- Comments (Conference acceptance info)
- Repository URLs
BibTeXCitation
- Automatically maintain a list of local papers and their metadata in a JSON file.
- Configure the desired download destination via an environment variable or a command-line argument.
- All downloaded papers will have standardized filename for easy browsing.
- Save time and effort to download and organize papers on your machine.
- Speedup downloading process by using multiple parallel connections.
- Local paper list would be handy for quick local lookup, making notes, and doing citations.
This is a command-line tool, use pip to install the package globally.
- Pre-requisite:
Python 3.x
python3 -m pip install --upgrade arxiv-dl(Optional) Install aria2c for download speedup.
- MacOS:
brew install aria2 - Linux:
sudo snap install aria2c
After installation, the command getpaper should be available in your terminal.
$ getpaper [-h] [-v] [-p] [-d DOWNLOAD_DIR] [-n N_THREADS] urls [urls ...]Options:
-v,--verbose(optional): Print paper metadata.-pb,--progbar(optional): Show progress bar.-p,--pdf_only(optional): Download PDF only without creating Markdown notes-d,--download_dir(optional): Specify one-time download directory. This option will override the default download directory or the one specified in the environment variableARXIV_DOWNLOAD_FOLDER.-n,--n_threads(optional): Specify the number of parallel connections to be used byaria2.
# Use ArXiv Paper ID
$ getpaper 1512.03385 2103.15538
# Use ArXiv Abstract Page URL
$ getpaper https://arxiv.org/abs/2103.15538
# Use ArXiv PDF Page URL
$ getpaper https://arxiv.org/pdf/1512.03385.pdf
# Use CVF Open Access URL
$ getpaper "https://openaccess.thecvf.com/content/CVPR2021/html/Lin_Real-Time_High-Resolution_Background_Matting_CVPR_2021_paper.html"- Default Download Destination:
~/Downloads/ArXiv_Papers - To set custom download destination, use the environment variable
ARXIV_DOWNLOAD_FOLDER. Include the following line in your.bashrcor.zshrcfile:export ARXIV_DOWNLOAD_FOLDER=~/Documents/Papers
- Precedence:
- Command-line option
-d - Environment variable
ARXIV_DOWNLOAD_FOLDER - Default download destination
- Command-line option
- You can always set your own preferred alias for the default
getpapercommand. - Include the following line(s) in your
.bashrcor.zshrcfile to set your preferred alias:alias dp="getpaper" alias dpv="getpaper -v -d '~/Documents/Papers'"
python3 -m venv venv && \
source venv/bin/activate && \
pip install -e ".[dev]"pytestmakemake clean- Add support for ara2c.
- Add support for papers on CVF Open Access.
- Add support for papers on OpenReview.
MIT License - Copyright (c) 2021-2022 Mark Huang
