- Introduction
- DNA-Seq Analysis
- RNA-Seq Analysis
- ARGO QC Tools and Workflows
- Mutational Signatures Analysis
- ARGO Reference Files
- ARGO Utility
- This gives an overview of major ICGC ARGO analysis pipelines which are running in production or with development work in progress. The content will be periodically reviewed and changed over time.
- Current ARGO production workflows are running in Regional Data Processing Centre (RDPC).
- Each ARGO workflow in the repositories has been tested on accompany test datasets.
- All ARGO analysis workflows are written in Nextflow with Dockerized tools.
- All ARGO workflows are published under an GNU AGPL v3 open-source license and are packaged for community usage
- Please check ICGC ARGO Documentation for more details.
- Workflow git repo: https://github.com/icgc-argo-workflows/dna-seq-processing-wfs
- This repository maintains the Nextflow workflow code for BWA-MEM alignment, which can be used for Whole Exome Sequencing (WXS), Whole Genome Sequencing (WGS), Targeted Sequencing, and some other DNA-Seq experimental strategies. The workflow takes either BAM or FASTQ files as inputs, performs reads mapping for each read group, and optional step of MarkDuplicates, then outputs a merged and sorted CRAM file, a CRAM index file, and various QC metrics files.
- Workflow git repo: https://github.com/icgc-argo-workflows/sanger-wgs-variant-calling
- This repository maintains the Nextflow workflow code for Sanger WGS somatic variant calling. The underlying containerized variant calling tools include CaVEMan, Pindel, ASCAT and BRASS, which were originated from https://github.com/cancerit/dockstore-cgpwgs.
- Workflow git repo: https://github.com/icgc-argo-workflows/sanger-wxs-variant-calling
- This repository maintains the Nextflow workflow code for Sanger WXS somatic variant calling. The underlying containerized variant calling tools include CaVEMan and Pindle, which were originated from https://github.com/cancerit/dockstore-cgpwxs.
- Workflow git repo: https://github.com/icgc-argo-workflows/gatk-mutect2-variant-calling
- This repository maintains the Nextflow workflow code for Mutect2 somatic variant calling. The underlying variant calling and related tools are originated from GATK https://github.com/broadinstitute/gatk.
- Workflow git repo: https://github.com/icgc-argo-workflows/open-access-variant-filtering
- This repository maintains the source code of the ICGC ARGO Open-access Somatic Variant Filtering Workflow. It is a bioinformatics workflow that can be used to filter SNV/InDel VCFs based on predefined genomic regions.
- WIP git repo: https://github.com/icgc-argo-workflows/argo-somatic-variant-calling
- This repository contains a collection of somatic SNV/InDel variant callers used for various analysis in the context of the ICGC ARGO project.
- WIP git repo: https://github.com/icgc-argo-workflows/icgc-argo-sv-copy-number
- This repository contains a collection of somatic SV/CNV callers used for various analysis in the context of the ICGC ARGO project.
- Workflow git repo: https://github.com/icgc-argo-workflows/rna-seq-alignment
- This repository maintains the Nextflow workflow code for STAR and HiSAT2 RNA-Seq alignment workflows that take either BAM and FASTQ files as inputs, and generates genome aligned CRAM, Splice Junction Quantifications, and various QC metrics files.
- WIP git repo: https://github.com/icgc-argo-workflows/rna-seq-expression-counting
- This repository contains the packages for RNA-Seq expression-counting/normalization analysis that are part of the ICGC ARGO data analysis pipeline.
- WIP git repo: https://github.com/icgc-argo-workflows/rna-editing
- This repository collects the tool packages used for RNA Editing analysis in the context of the ICGC ARGO project.
- WIP git repo: https://github.com/icgc-argo-workflows/alternative-promoter-analysis
- This repository collects the tool packages used for Alternative Promoter analysis in the context of the ICGC ARGO project.
- WIP git repo: https://github.com/icgc-argo-workflows/allele-specific-expression
- This repository collects analysis package used for allele specific expression(ASE) in the context of the ICGC ARGO project
- WIP git repo: https://github.com/icgc-argo-workflows/rna-seq-splicing
- This repository contains the packages for RNA-Seq splicing analysis that are part of the ICGC ARGO data analysis pipeline.
- WIP git repo: https://github.com/icgc-argo-workflows/rna-seq-data-simulation
- This repository contains code for the generation of simulated RNA-Seq reads that can be used for the benchmarking of different parts of the ICGC ARGO RNA-Seq analysis pipeline.
- WIP git repo: https://github.com/icgc-argo-workflows/argo-qc-tools
- This repository contains a collection of data QC tools/modules used in various ICGC ARGO workflows
- WIP git repo: https://github.com/icgc-argo-workflows/icgc-argo-mutational-signatures
- This repository contains a collection of analysis packages that are part of the ICGC ARGO mutational signatures pipeline.
- WIP git repo: https://github.com/icgc-argo-workflows/argo-reference-files
- This repository contains urls and metadata for all reference files used by the ICGC-ARGO data analysis pipelines
- Metadata payload generation tools: https://github.com/icgc-argo-workflows/data-processing-utility-tools
- Song/Score download/upload tools: https://github.com/icgc-argo-workflows/nextflow-dna-seq-processing-tools
- Quality control/benchmarking metrics collecting tools: https://github.com/icgc-argo-workflows/data-qc-tools-and-wfs
- DNA-Seq data processing tools: https://github.com/icgc-argo-workflows/dna-seq-processing-tools
- Genomic variant calling tools: https://github.com/icgc-argo-workflows/variant-calling-tools
- Dockerized GATK tools: https://github.com/icgc-argo-workflows/gatk-tools
- Benchmark datasets: https://github.com/icgc-argo-workflows/metadata-for-benchmarking-datasets
- Workflow migration tool: https://github.com/icgc-argo-workflows/workflow-repo-migration