public-law/open-gov-crawlers

Data publishing

Closed this issue · 4 comments

Try different methods of publishing the scraping results (the JSON data). E.g.:

  • Kaggle.
  • A GitHub repo, dedicated to this purpose.
  • Zyte's thing, whatever it's called.

After trying each of the above, I've chosen the second option: a dedicated repo for publishing data

https://github.com/public-law/datasets

  • Choose a license
  • Write the readme with a link to the Rome Statute English.
  • Add a Dublin Core Metadata object to the Rome Statute dataset: #93

Kaggle has stopped fixing and further developing the web app. There are bugs and half-finished changes from years ago that haven't been completed, particularly relating to Organizations.

Byte's public dataset sharing seems to no longer be supported and is difficult to use.

I think that a dedicated GitHub repo is the way to go. Maybe a Jekyll frontend.

I'm setting up a GitHub repo, datasets, for publishing the data.