/dsp_nextflow_training

Hands-on Nextflow training materials and information

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

Nextflow fundamentals training

Is your code reusable? Are your results reproducible? Tired of reinventing the wheel? The thing is that reproducibility and even repeatability is challenging even if you are very careful about keeping the same software parameters and versions. Do you want to learn how to write reusable, reproducible, and scalable pipelines?

Nextflow is an advanced workflow management system designed to streamline data-driven processes, especially in the bioinformatics and computational biology fields. It allows users to develop, to execute, and to scale complex workflows across various environments (i.e local machine, Cloud - Azure, AWS, etc... - or HPC). This system is simple and flexible and supports different programming languages (i.e. Python, R, Bash) and containers (e.g Docker, Singularity, etc...).

Let us introduce you to Nextflow and unlock you a door to a vibrant community building and maintaining standardized reproducible bioinformatics analyses, trainings, hackathons and resources to make your data-driven analysis scalable an reproducible.

Objectives

In this course you will learn:

  • What is Nextflow?

  • Why is needed?

  • Nextflow concepts (e.g. channels, processes and operators, parallellism, reentrancy, reusability)

  • Core features (e.g Portability, Scalability, Reproducibility, Modularity)

  • Installing Nextflow (requirements, you will use a prepared dev environment)

  • Write and run your first Nextflow script (nextflow script, config file, results, working directory)

  • Run a nextflow pipeline in a Docker image (run for example a short version of an rnaseq pipeline)

  • Seqera platform (monitoring your pipeline execution)

  • nf-core community (113 bioinformatics standardized workflows)

  • Resources to keep training