/rnaseq

RNA sequencing analysis pipeline using STAR or HISAT2, with gene counts and quality control

Primary LanguageHTMLMIT LicenseMIT

nfcore/rnaseq

Build Status Nextflow DOI Gitter

Introduction

UCL-BLIC/rnaseq is a bioinformatics analysis pipeline used for RNA sequencing data, modified to add kallisto.

The workflow processes raw data from FastQ inputs (FastQC, Trim Galore!), aligns the reads (STAR or HiSAT2), generates gene counts (featureCounts, StringTie) as well as kallisto abundance files, and performs extensive quality-control on the results (RSeQC, dupRadar, Preseq, edgeR, MultiQC). See the output documentation for more details of the results.

The pipeline is built using Nextflow, a bioinformatics workflow tool to run tasks across multiple compute infrastructures in a very portable manner.

Documentation

The UCL-BLIC/rnaseq pipeline comes with documentation about the pipeline, found in the docs/ directory:

  1. Installation
  2. Running the pipeline
  3. Output and how to interpret the results
  4. Troubleshooting

Credits

These scripts were originally written for use at the National Genomics Infrastructure, part of SciLifeLab in Stockholm, Sweden, by Phil Ewels (@ewels) and Rickard Hammarén (@Hammarn).

Many thanks to other who have helped out along the way too, including (but not limited to): @Galithil, @pditommaso, @orzechoj, @apeltzer, @colindaven.