What is the (simplest) way to maintain datasets in github.com/datasets?
rufuspollock opened this issue · 1 comments
rufuspollock commented
Quite a few of the core datasets are not getting updated. This raises the questions how we maintain the core datasets e.g.
- What patterns and tooling for creating and running data pipelines
- orchestration / running of that e.g. github actions or something else
- how we log errors
- how we (socially) know what datasets need updating and when (i.e. their frequency)
We could try this out re work of getting core datasets up to date #376
sabas commented
I try to maintain my main dataset (UN/LOCODE) when I remember, perhaps it could be useful to have a way of monitoring when a new release is needed by scraping the page from which we grab the dataset?
On datahub I see my dataset is stuck to 3 years ago, why?