An educational tutorial and working demonstration pipeline for RNA-seq analysis including an introduction to: cloud computing, next generation sequence file formats, reference genomes, gene annotation, expression analysis, differential expression analysis, alternative splicing analysis, data visualization, and interpretation.
This repository is used to store code and certain raw materials for a detailed RNA-seq tutorial. To actually complete this tutorial, go to the RNA-seq tutorial wiki.
Citation: Malachi Griffith*, Jason R. Walker, Nicholas C. Spies, Benjamin J. Ainscough, Obi L. Griffith*. 2015. Informatics for RNA-seq: A web resource for analysis on the cloud. PLoS Comp Biol. 11(8):e1004393.
*To whom correspondence should be addressed: E-mail: mgriffit[AT]genome.wustl.edu, ogriffit[AT]genome.wustl.edu
Note: An archived version of this tutorial exists here. This version is maintained for consistency with the published materials (Griffith et al. 2015. PLoS Comp Biol.) and for past students wishing to review covered material. However, we strongly suggest that you continue with the current version of the tutorial below.
Fork it and send a pull request.
- Module 0 - Introduction to NIH HPC Biowulf and Unix
- Module 1 - Introduction to RNA sequencing
- Module 2 - RNA-seq Alignment and Visualization
- Module 3 - Expression and Differential Expression
- Expression
- Differential Expression
- DE Visualization
- Kallisto for Reference-Free Abundance Estimation
- Module 4 - Isoform Discovery and Alternative Expression
- Reference Guided Transcript Assembly
- de novo Transcript Assembly
- Transcript Assembly Merge
- Differential Splicing
- Splicing Visualization
- Module 5 - De novo transcript reconstruction
- Module 6 - Functional Annotation of Transcripts
- Appendix