/PDFSCRAPE

PDFSCRAPE is a script written in Python which will allows you to automatically download pdfs from http://ndl.ethernet.et.edu

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

pdfscrape

PDFSCRAPE

PDFSCRAPE is a script written in python which allows you to automatically download(scrape) pdfs from http://ndl.ethernet.et.edu

Features

  • Easy to use
  • Fast download
  • Scrapes pdfs only from selected department
  • Continue your downloads if download is interrupted.

Requirements

  • Beautiful soup

  • tqdm

Installation

clone this repo

gitclone https://github.com/L0rdC0mm4nd3r/pdfscrape

Install requirements

pip3 install -r requirements.txt

Usage

You can start PDFSCRAPE by

python3 pdfscrape.py

OR

chmod a+x pdfscrape.py && ./pdfscrape.py

GIF

Author

License

This project is licensed under the GNU General Public License v3.0 License - see the LICENSE file for details

Acknowledgments

  • Hat tip to anyone whose code was used