/arxiv_image_crawler

Download the image source file from arxiv bulks data

Primary LanguagePython

Arxiv Image Crawler

It can crawl the arxiv image from a certain cut of date. You need to register the AWS account to use the s3 service. AWS service

To run the code, first enter the access key and secret access key in the shell script. Then

bash crawl_arxiv.sh

To modify the cutoff date, you can enter crawl.py