This python script automates the JSTOR download process for books with many individual chapters, for example, https://www.jstor.org/stable/10.1525/j.ctv1xxxq7
In addition to download all individual chapters, it will also merge the chapters into a single pdf file.
An IP address that has access to the JSTOR is required.
python3 -m pip install -r requirements.txt
- Modify the
book_url
and theparent_directory
variables in the script. - Run the script.
- JSTOR is very aggressive in preventing any form of batch download.
- You might have to redo the captcha during the chapter download process. If the captcha is not finished with in
time_for_recaptcha
, the download process will fail, you will have to restart the script.