NGS_DNA pipeline

Manual

Find manual on installation and use at https://molgenis.gitbooks.io/molgenis-pipelines/

Summary

The sequencer is producing reads (in FastQ format) and are aligned to the hg19 reference genome with BWA (Li & Durbin ¹). Sambamba (Tarasov et al.²) is processing the aligned reads and then we applied GATK (McKenna et al. ³) duplicate removal, performed SNP and INDEL discovery and genotyping using standard hard filtering parameters to GATK Best Practices recommendations (Van der Auwera et al.⁴)

References

1. Li Durbin, Fast and accurate short read alignment with Burrows-Wheeler transform. 2. Sambamba: Fast processing of NGS alignment formats 3. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data 4. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline

mswertz/NGS_DNA

NGS_DNA pipeline

Manual

Summary

References