/encode3-curation

Various work in progress files working on trying to curate our ENCODE data submissions.

Primary LanguageJupyter Notebook

encode3-curation

This is my collection of IPython notebooks to document some of the work I've done to try and curate the Wold Lab's data submissions to the ENCODE-DCC.

Originally I just started sharing it so I could show my work to our super data wrangler.

However I then started writing some notebooks that might be considered useful documentation.

Git Annex

There were a few pdfs that I wanted to track with in this repository, but partially because some were still embargoed and because committing binary files to git has issues I decided to use git-annex to track them.

Homepage: git-annex

git-annex is available in Debian, Ubuntu, and OS X Homebrew.

to download the pdfs you would do:

git clone https://github.com/detrout/encode3-curation.git
cd encode3-curiation
git annex init
git annex get <pdf name>