
Scrapy crawler that crawls cidadao.reclameaqui and extracts info from complaint submission pages

Primary LanguagePython


Sample complaint submission page
Stuff scraped:

  • Title
  • Complaint (paragraph)
  • Category/Service
  • Location (Servico Publico)
  • Date

Each of the complaints were saved in individual files separated by year and month and also named after randomly generated 3-name long Japanese female names. alt text