/ICGC-featureCounts

WIP: A Nextflow pipeline to run featureCounts on RNAseq BAM files on ICGC in AWS/AWS Batch

Primary LanguageNextflowMIT LicenseMIT

ICGC-FeatureCounts

WIP: This is still in development, but should be relatively stable by now.

This pipeline uses featureCounts on cancer datasets from ICGC and generates count matrices, similar to what nf-core/RNAseq does. Users can specify a ICGC Manifest file with object ids, which will then be converted to encrypted S3 URLs. The pipeline then uses the provided GTF file to generate count matrices for all files in the manifest.

Build Status Nextflow install with bioconda Docker Automated build https://www.singularity-hub.org/static/img/hosted-singularity--hub-%23e32929.svg

Introduction

The pipeline is built using Nextflow, a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It comes with docker / singularity containers making installation trivial and results highly reproducible.

Documentation

The ICGC-FeatureCounts pipeline comes with documentation about the pipeline, found in the docs/ directory:

  1. Installation
  2. Pipeline configuration
  3. Running the pipeline
  4. Output and how to interpret the results
  5. Troubleshooting

Credits

This pipeline was written by Alexander Peltzer (apeltzer) at QBiC.