/ksp

Scrapers for Klimatschutzprogramme

Primary LanguagePythonMIT LicenseMIT

Scraping Klimaschutzprogramme

Various scrapers for regional climate action plans ("Klimaschutzprogramme").

Sources

Running

# list scrapers
$ docker-compose run scrapy list

# run a specific crawler and save the output in the data directory 
$ docker-compose run scrapy crawl eea -o /data/eea-participants.jl

# view data using jq
$ head -n1 data/eea-participants.jl | jq .
{
  "name": "Aachen",
  "postcode": "52064",
  "state": "Nordrhein-Westfalen",
  "eea-status": "Gold-zertifizierte Stadt / Gemeinde",
  "homepage": "http://www.aachen.de",
  "eea-url": "https://www.european-energy-award.de/kommunen/liste-der-eea-kommunen/details/eea/aachen/"
}

Caching is enabled and set to cache forever. Use the -s HTTPCACHE_ENABLED=False flag to override, or change it in the settings