/riboseqorg_metadata

Ribo-seq metadata standardization for RiboCrypt and Riboseq.org

Primary LanguageR

Ribo-seq metadata standardization for RiboCrypt and Riboseq.org

The script does 3 things:

  1. Given a Entrez fetch table ~ 700 columns from SRA (not included in the scripts)
  2. Standardize column names (CELL_LINE, CELL LINE, celllines are all the same)
  3. Standardize column values: (Ribo-seq, Riboseq, RIBOSEQ are all the same)
  4. Semi manual annotation (HeLA is female cell line, HEK is male etc)

Finally upload this file to google drive with statistics of how much could be standardized.

About

The procedure is packaged into 3 scripts ran from: metadata_main_script.R