PDFSCRAPE is a script written in python which allows you to automatically download(scrape) pdfs from http://ndl.ethernet.et.edu
- Easy to use
- Fast download
- Scrapes pdfs only from selected department
- Continue your downloads if download is interrupted.
-
Beautiful soup
-
tqdm
clone this repo
gitclone https://github.com/L0rdC0mm4nd3r/pdfscrape
Install requirements
pip3 install -r requirements.txt
You can start PDFSCRAPE by
python3 pdfscrape.py
OR
chmod a+x pdfscrape.py && ./pdfscrape.py
This project is licensed under the GNU General Public License v3.0 License - see the LICENSE file for details
- Hat tip to anyone whose code was used