Datasets as a worked example
Closed this issue · 2 comments
happykhan commented
Create some docs, use these Datasets as a worked example:
- Failed QC
- VOI/VOC lineages
happykhan commented
Fetching datasets
conda create -n datasets-sars-cov-2 -c conda-forge -c bioconda uscdc-datasets-sars-cov-2
conda activate datasets-sars-cov-2
export NCBI_API_KEY="<your-NCBI-API-key-here>"
GenFSGopher.pl --numcpus 8 --compressed --outdir vocvoi-dataset ~/miniconda3/envs/datasets-sars-cov-2/share/uscdc-datasets-sars-cov-2/sars-cov-2-voivoc.tsv
GenFSGopher.pl --numcpus 8 --compressed --outdir failedQC-dataset ~/miniconda3/envs/datasets-sars-cov-2/share/uscdc-datasets-sars-cov-2/sars-cov-2-failedQC.tsv
happykhan commented
Data sets available on zenodo.
Test data is available here: https://zenodo.org/record/7018405