/GaNCH-data

Using Linked Open Data for Georgia's Natural, Cultural, and Historic Organizations' Disaster Response

Primary LanguagePythonGNU Lesser General Public License v3.0LGPL-3.0

GaNCH

GaNCH: Using Linked Open Data for Georgia's Natural, Cultural, and Historic Organizations' Disaster Response.

https://ganch.auctr.edu

The Atlanta University Center Robert W. Woodruff Library would like to gratefully acknolwedge LYRASIS for their support of this project via a 2019 LYRASIS Catalyst Fund grant

This one-year project will create a publicly editable directory of Georgia’s Natural, Cultural and Historical Organizations (NCHs), allowing for quick retrieval of location and contact information for disaster response. Directory information will be compiled, updated, and uploaded to Wikidata, the linked open data database from the Wikimedia Foundation. Directory information will then be delivered via a website, allowing emergency responders to quickly search for NCHs in disaster areas.

("GaNCH" rhymes with "ranch")

Data

  • Data Dictionary - Mapping metadata fields to Wikidata's schema
  • Data Sources - Where we're getting the directory information
  • Index - Index of all the organizations we've created/edited in Wikidata
  • "Instance of" taxonomy - Taxonomy of all the P31 "Instance of" organization types that we are including, and their relevant subclasses. This helps us construct the SPARQL queries we use, since we can say "find me all cultural institutions or anything that's a subclass of cultural institution" to save time and energy.
  • Municipalities - A spreadsheet of Georgia's municipalities and all their counties, mapping the problem where about 10% of Georgia's municipalities belong to more that one county (see: Addressing Challenges).
  • Schema - The OpenRefine schema that we use to reconcile against Wikidata's data model before uploading.
  • Template - The CSV template that we make all the source datasets use.
  • SPARQL_GEMA - Examples of SPARQL queries for the GEMA Regions, showing how we use VALUES lists to 1) combine multiple counties together to create the regions, and 2) combine multiple "instance of"s together to get our specific results

Documentation

Setup

Partners

  • Project Partners List - The organizations that we're partnering with in Georgia to supply directory information, including a point of contact at each organization

Workflow

  • The Workflow Manual provides a step-by-step process for accomplishing the tasks of the project to help others replicate or adapt our process for their own region.
  • Addressing Challenges describes and documents some of the challenges we encountered during the project, and how we are addressing them.