/mdd-rmUKBB

MDD meta-analysis removing individual overlap with UK Biobank

Primary LanguageHTMLMIT LicenseMIT

mdd-rmUKBB

Psychiatric Genomics Consortium (PGC) Major Depressive Disorder (MDD) genome-wide association study meta-analysis removing individual overlap with UK Biobank.

Project overview

Many uses of genome-wide summary statistics require that there be no sample overlap between the discovery and testing datasets. UK Biobank is an open health data set which has been included in previous PGC Major Depressive Disorder GWAS (Wray et al 2018, Howard et al 2019). Because UK Biobank (UKBB) is used by many researchers, we have conducted and released GWAS summary stastics where overlap with UKBB has been excluded.

Datasets used are individual level data from the MDD Wave2 cohorts and summary statistics from the additional MDD cohorts (deCODE, GenScot, GERA, iPsych, 23andMe).

Data for this project are held on LISA in the directories listed in the README.mddw2sum and README.mdd00001 files in your LISA home directory. Preimputation QC and imputation was performed previously using the RICOPILI modules.

This project is viewable at https://psychiatric-genomics-consortium.github.io/mdd-rmUKBB/.

Setup and render notebook

To get an interactive session on LISA and load required programs:


#Go into interactive mode
srun -n 16 -t 1:00:00 --pty bash -il

#Load R
module load pre2019
module load R/3.4.3

# download pandoc
curl -L -O https://github.com/jgm/pandoc/releases/download/2.9.1.1/pandoc-2.9.1.1-linux-amd64.tar.gz
tar xzf pandoc-2.9.1.1-linux-amd64.tar.gz


Generate main GWAS notebook


export PATH=$PATH:pandoc-2.9.1.1/bin
Rscript -e "rmarkdown::render('gwas.Rmd')"

Data Availability

Meta-analyzed summary statistics excluding 23andMe will be available for download from the PGC as "PGC MDD No UKB / No 23andMe". Results including 23andMe will be available by contacting the PGC Data Access Committee

Requirements

  • Ricopili - Rapid Imputation and COmputational PIpeLIne for conducting GWAS and meta-analysis
  • R project - For preparing phenotype files
  • R packages: readr, dplyr, stringr, tidyr
  • Individual-level data access: Data is held securely on the LISA server

Analysts

Major Depressive Disorder Working Group of the Psychiatric Genomics Consortium

  • Mark James Adams - analyst - Edinburgh
  • Swapnil Awasthi - analyst - Broad
  • David Howard - analyst - KCL
  • Naomi Wray - analytical group director - Queensland
  • Stephan Ripke - analytical group director - Broad
  • Andrew McIntosh - workgroup chair - Edinburgh
  • Cathryn Lewis - workgroup chair - KCL

License

This project is licensed under the MIT License - see the LICENSE.md file for details

Acknowledgments

Contact

References

  • Wray, N.R., Ripke, S., Mattheisen, M. et al. Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression. Nat Genet 50, 668–681 (2018). DOI:10.1038/s41588-018-0090-3
  • Howard, D.M., Adams, M.J., Clarke, T. et al. Genome-wide meta-analysis of depression identifies 102 independent variants and highlights the importance of the prefrontal brain regions. Nat Neurosci 22, 343–352 (2019). DOI:10.1038/s41593-018-0326-7