/worldcat-scraper

Spider for book titles from Worldcat.

Primary LanguagePython

Worldcat Scraper

Collects book titles from Worldcat's search pages. See this example of a scrape getting the titles of all Dutch fiction books released in 2019.

Note: Worldcat search only allows you to scroll through the first 5000 hits.

Usage

scrapy runspider spider.py

Or if you want to save the output, e.g. as CSV:

scrapy runspider spider.py --output=res.csv -t csv