/BNF_Gallica_Scraper

A simple python scraper for the gallica.bnf.fr website (output is High Res JPEG)

Primary LanguagePython

BNF_Gallica_Scraper

Say you have an old ereader that don't support PDF scans and you still want to read old books from the awesome https://gallica.bnf.fr archive. Well you just use this script.

Plz no abuse! The french gov't has been nice enough to set this thing up, let's be thankful and not di*ks

🇫🇷🇫🇷🇫🇷 French version of the Doc: https://nazmi.fr/gallica_bnf_scraper/

Video tutorial coming soon

How To

Download bnfscrape.py

Put it in an empty folder

Have python and python-pip installed + the dependency library: wget

pip-install wget

Edit the 3 variables in the top of the script

fraum = download from this page

tau = download to this page

part1 = beginning of the URL, see the example

(to get the url, just right click on one page of the document in the booklet viewer and then click "open image in new tab / view image" )