This course will provide a thorough introduction to the application of metabarcoding techniques in microbial ecology.
The topics covered by the course range from bioinformatic processing of next-generation sequencing data to the most important approaches in multivariate statistics. Using a combination of theoretical lectures and hands-on exercises, the participants will learn the most critical computational steps of a metabarcoding study from the processing of raw sequencing reads down to the final statistical evaluations. After completing the course, the participants should be able to understand the potential and limitations of metabarcoding techniques as well as to process their own datasets to answer the questions under investigation.
FORMAT
This course is designed for researchers and students with strong interests in applying novel high-throughput DNA sequencing technologies to answer questions in the area of community ecology and biodiversity.
The course will mainly focus on the analysis of phylogenetic markers to study bacterial, archaeal and fungal assemblages in the environment, but the theoretical concepts and computational procedures can be equally applied to any taxonomic group or gene of interest.
ASSUMED BACKGROUND
The participants should have some basic background in biology and understand the central role of DNA for biodiversity studies. No programming or scripting expertise is required but completing the preparation materials to be comfortable with linux basics (i.e. making directories & copying files) is important.
!! See the Unix short tutorial if you need a reminder/refresher.
All the hands-on exercises will be carried out using QIIME2 platform . No previous knowledge of computer science is required but a basic knowledge of “bash” would allow to focus more on the microbial analysis. LEARNING OUTCOMES
- Understanding the concept, potential and limitation of microbial metabarcoding techniques.
- Learning how to process raw sequencing reads to obtain meaningful information.
- Obtaining experience on how to statistically evaluate and visualize your data.
- Being able to make informed decisions on best practices for your own data.
❗ ❗ ❗ All the papers discussed during the course are available in this Google Drive Folder ❗ ❗ ❗
Physalia Welcome [Course participants introduction]
- Introduction to the course and Instructors:
- Presentation - Brief introduction about Metagenomics concepts. What are we going to learn?
- Practical - Connecting to the server via SSH
- Presentation - Sequence data in the FASTQ format and Quality Filtering
- Practical - First hands on data (& linux review)
- Presentation- Experimental design part 1
- Presentation - Qiime2 & Qiime objects
- Practical - Let's begin! Atacama Soil Experiment and Loading data in Qiime
- Presentation - Concepts on Negative and Positive controls
- Presentation - Key concept: OTU or ASV;
- Different Sequencing Technologies and different denoising approaches
- Practical - Denoising and DADA2
- Presentation - Experimental Design part 2
- Presentation - Diversity Analysis in Qiime
- Practical - Running Alpha and Beta Diversity in QIIME2
- Presentation - Taxonomic assignment
- Practical - Taxonomy assignment:classify-sklearn
- Fungi and ITS best practices!
- Lecture - Multivariate Analysis of Ecological Communities
- Traits of Alpha and Beta Diversity (richness, evenness, dispersion)
- Multivariate Tests for differences in microbial community composition
- Lab - Multivariate Statistics
- Data import & preparation (normalisations, transformations, metadata)
- Multivariate tests for differences in community composition (PERMANOVA, PERMDISP)
- Lecture - Differential Abundance Analysis of Microbial Communities
- Lab: Differential Abundance Analysis with DESeq2
- ➡️ It's Case studies Time!!!
- Practical - Human microbiome data
- Practical - Zoological(Bees bees bees!)
- Practical - Water-mosquito
- Record your Case Study findings
- Some additional Tips 🌋
- Alternative loading methods in Qiime
- What computer to get, and how to install Qiime
- Further, Understanding and Assessing your raw data
- Beer Time 🍻 🍻 🍻