/Hackathon_NGS_2022

Projet Master2 AMI2B test de reproductibilité

Primary LanguageHTMLMIT LicenseMIT

Project Repro-Hackathon

Project of the Master of Bioinformatics (AMI2B) of the University Paris-Saclay realized by :

Presentation

The goal is to reproduce parts of the analysis described in these papers (to read):

They performed RNA-Seq in samples from patients with uveal melanoma. Some samples are mutated in SF3B1 . We want to analyze this data in order to find differentially expressed genes, i.e. genes that are more (or less) expressed in one condition (SF3B1 mutated samples) compared to another (SF3B1 non mutated samples).

Organization

To do this, we have designed and implemented a reproductible workflow.

  • A Directory Tools to build all containers (using Singularity) with the tools that will be used in the workflow.
  • A Directory Workflow with the rules and files (using Snakemake) needed for the workflow.
  • A Script run.sh to execute the workflow.
  • A README file

Images Resumes

alt text

Rules Resumes :

alt text

About IFB Cloud

"French Institute of Bioinformatics (IFB) provides life scientists with a federation of clouds, Biosphere, and bioinformatics cloud services to analyze life science data. Biosphere is used for scientific production in the life sciences, developments, and to support events like cloud and scientific training sessions, hackathons or workshops."

Setting up a VM

IFB Cloud Biosphere

For the moment the pipeline work for a VM BioPipes "ifb..mxlarge (16 vCPU, 64Go GB RAM, 400Go local Disk)"

It should also work for a VM of 8 CPUs, below 8 the indexing of the whole genome is impossible.

Run the workflow

./run.sh