/VQSR

GATK Variant Quality Score Recalibration for filtering variant calling set

Primary LanguagePythonBSD 3-Clause "New" or "Revised" LicenseBSD-3-Clause

Snakemake License DOI

Snakemake Workflow for Variant Quality Score Recalibration (VQSR)

This is an implementation for GATK Variant Quality Score Recalibration (VQSR) using snakemake pipeline written by Sherine Awad.

You will need your cohort vcf file, you can change its name and path in the config file.

Run the pipeline

snakemake -jn 

where n is the number of cores for example for 10 cores use:

snakemake -j10 

Use conda

For less froodiness, use conda:

snakemake -jn --use-conda 

For example, for 10 cores use:

snakemake -j10 --use-conda 

This will pull automatically the same versiosn of tools we used. Conda has to be installed in the system, in addition to snakemake.

Dry Run

For a dry run use:

snakemake -j1 -n 

and to print command in dry run use:

snakemake -j1 -n -p 

Use Corresponding configfile:

You can have a specific config file for each cohort, and pass them accordingly as follows:

snakemake -j1 --configfile config-WES.yaml 

or:

snakemake -j1 configfile config-WGS.yaml 

References

  1. Brouard, Jean-Simon, Flavio Schenkel, Andrew Marete, and Nathalie Bissonnette. "The GATK joint genotyping workflow is appropriate for calling variants in RNA-seq experiments." Journal of animal science and biotechnology 10, no. 1 (2019): 1-6.

  2. Van der Auwera, Geraldine A., Mauricio O. Carneiro, Christopher Hartl, Ryan Poplin, Guillermo Del Angel, Ami Levy‐Moonshine, Tadeusz Jordan et al. "From FastQ data to high‐confidence variant calls: the genome analysis toolkit best practices pipeline." Current protocols in bioinformatics 43, no. 1 (2013): 11-10.

  3. Poplin, R., Ruano-Rubio, V., DePristo, M. A., Fennell, T. J., Carneiro, M. O., Van der Auwera, G. A., ... & Banks, E. (2018). Scaling accurate genetic variant discovery to tens of thousands of samples. BioRxiv, 201178.

  4. https://gatk.broadinstitute.org/hc/en-us/articles/360035531112--How-to-Filter-variants-either-with-VQSR-or-by-hard-filtering

  5. https://gatk.broadinstitute.org/hc/en-us/articles/360035531612