callahantiff/PheKnowLator

Wiki: Update v2.0.0 data source hyperlinks

Closed this issue · 6 comments

Task: Make sure that all data source links are updated on the Wiki and throughout repo for current build.

Update: Upload all data referenced in Wiki and used in current build to Zenodo and associate it with the the GitHub release.

Make sure that links are also updated in each releases Jupyter Notebooks:

  • main.ipynb
  • Ontology_Cleaning.ipynb
  • Data_Preparation.ipynb

Also, replace dropbox links with links to zenodo used when downloading the data sources:

  • resources/resource_info.txt
  • resources/ontology_source_list.txt
  • resources/edge_source_list.txt

TASK

Task Type: PKT DATA DELIVERY

Use Google Cloud Storage (GCS) bucket with dated subdirectories to store data and output for each build. This would let us have automatic updated build information and provide the following things:

  • Source data used to build the KG
  • Built KG output

TODO

  • Connect to GCS storage and update release Wikis to point to relevant builds (related issue #70)
  • Look into archival back-up storage options as well

Container Storage
Docker container for each KG build type (does not contain the source data -- this is retrieved from the GCS bucket) stored on DockerHub or Google's Container registry

Done! All wiki pages have had DropBox links removed and now point the GCS buckets.