This topic will cover:
- how to use shotgun metagenomic sequencing technologies to answer biological questions
- the methodologies used to perform metagenomic studies on the taxonomy, function, evolution, and ecology of microorganisms
- the intricacies of the specific tools and databases that can used for varying levels of specificity.
Lesson 1: What is metagenomics?
- [Presentation Slides] and [Presentation Video] by Eric Collins
- What kinds of questions can metagenomics be used to answer?
- Is metagenomics right for me?
- Yay! You got data! Now what?
- [Video Tutorial] by Alexis Marshall
- [Bash script]
- Programs used:
- fastqc
- multiqc
Lesson 2: Taxonomic Classification using k-mers
- [Presentation Slides] and [Presentation Video] by Eric Collins
- The power of k-mers
- Choosing a k-mer classifier
- [Video Tutorial] by Eric Collins
- [Bash script]
- Programs used:
- kraken2
- bbduk
- krona
- [Video Tutorial] by Ella Sieradzki
- [Bash script]
- Programs used:
- fastqc
- multiqc
- trimmomatic
Lesson 3: Taxonomic Classification using MinHash sketches
- [Presentation Slides] and [Presentation Video] by Eric Collins
- The Jaccard Index
- How does MinHash work?
- How do Bloom filters work?
- Choosing a MinHash sketcher
- [Video Tutorial] by Eric Collins
- [Bash script]
- Programs used:
- sourmash
- sendsketch
- bbduk
Lesson 4: Read mapping and read taxonomic classification
- [Presentation Slides] and [Presentation Video] by Luis Valentin and Ella Sieradzki
- How does mapping work?
- Choosing a metagenomic read mapper
- The Burrows-Wheeler transform and using bowtie2
- [Video Tutorial] by Ella Sieradzki
- [Bash script]
- Programs used:
- bbmap
- bbwrap
- bbsplit
- [Video Tutorial] by Ella Sieradzki
- Programs used:
- Kaiju
- Centrifuge
Lesson 5: Genome and Metagenome Assembly
- [Presentation Slides]
- Sequencing Statistics
- Overlap-Layout-Consensus assembly
- de Bruijn Graph assembly
- Assembly software
- [Bash Script]
- Programs used:
- bbduk
- tadpole
- spades
- statswrapper
- How do you determine the quality of an assembly?
- Choosing an assembly quality metric
- Demo using QUAST, etc.
- What is an assembly graph useful for?
- Demo using Bandage
- How does metagenomic binning work?
- Choosing a metagenomic binner
- Demo using metabat2, CONCOCT, etc.
- What are single-copy core genes?
- Choosing a bin evaluation tool
- Demo using CheckM, etc.
- What are single-copy core genes?
- Basics of sequence alignment
- Basics of phylogenetic tree construction
- Demo using GToTree, GTDBtk, etc.
- A review of existing workflows
- Why you probably shouldn’t invent your own workflow
- What to do when you inevitably decide to invent your own workflow anyway
- Demo using SnakeMake, Nextflow, etc.
- Anvi’o
- MGnify
- MG-RAST
- Galaxy
- JGI
- Cyverse
- KBase
- etc