sra_metadata as standalone function that creates `metadata.yaml` sheet

Question

sra_metadata as standalone function that creates `metadata.yaml` sheet

Closed this issue 3 years ago · 2 comments

See #41 for some context.

We want to split out sra_metadata.py so that it actually is a standalone python script that will create a metadata.yaml sheet from querying SRA, for qcdb.db_load to parse and load. It will take as input a path to a data folder, and an output path where metadata.yaml will go. The data folder should have either SRR ids as generated from sra-tools/fasts-dump, or SRX_SRS ids as generated from bioflows.

Answer 1 · 2022-05-02T19:26:40.000Z

For a sheet that's based on data from SRA, db_id should be filled out as an SRS_SRX id, and it should also have fields sample_id: SRS and experiment_id: SRX.

Answer 2 · 2022-08-02T02:47:30.000Z

All set and pushed to GitHub