Datenschule/jedeschule-scraper

update scrapers

Closed this issue · 1 comments

update the scrapers for all federal states

  • Nordrhein-Westfalen
  • Niedersachsen
  • Bayern
  • Rheinland-Pfalz
  • Hessen
  • Saarland
  • Berlin (Pending PR: #46 #81)
  • Brandenburg (Pending PR: #45)
  • Schleswig-Holstein
  • Mecklenburg-Vorpommern (#58)
  • Thüringen
  • Sachsen (#52)
  • Sachsen-Anhalt (#53)
  • Bremen
  • Baden-Württemberg
  • Hamburg

@cursorius-cursor has updated and rewritten the MV scraper: cursorius-cursor@9bb489b

Apart from fixing the scraper, the biggest change is that almost all keys/abbreviations are directly read from the Excel file (except for those that are missing from the legend).

However, the above version produces two different keys for the school type (space vs. newline):
row['Schulart/ Org.form']
and
row['Schulart/\nOrg.form']