zhanghuimeng/arxiv-crawler

Python

arXiv Page Crawler

Crawls a set of information from an arXiv.org article.

Basic Information
Abstract Translation
Articles Download

Installation

pip3 install beautifulsoup4
pip3 install request
pip3 install pyyaml

Put your baidu translation API key in key.yaml:

APP_ID: "my app id"
KEY: "my key"

Usage

python crawler.py --link [link]
python crawler.py --file [file of links]
python crawler.py --file [file of links] --download