bank-green/bankgreen-django

Ingest wikidata data sources

Closed this issue · 0 comments

Create wikidata data sources from a SPARQL query

Wikidata maintains an excellent dataset of banks worldwide, querable through SPARQL on their public API.

Update the python manage.py refresh_datasources command so that it creates datasources from the Wikidata API.

You can find a similar process and SPARQL query at
https://github.com/bank-green/banks/tree/main/sources/wikidata

Notes:

  • This involves both creating and updating wikidata datasources
  • The SPARQL query will need an update. It currently also captures poorly encoded buildings and defunct banks.
  • Wikidata sometimes provides subsidiary information. This should be marked as 100% ownership in subsidiary_of_1 pct