/data

Data resources created by the Leek group

Curated Data Sets

  • tidypvals - data from over 2.5 million published p-values compiled by Jeff Leek.
  • recount2 - large collection of RNA-seq data created by Leo Collado Torres, Ben Langmead, and Abhi Nellore.
  • Recount a database of free RNA-seq data sets at the gene count level created by Alyssa Frazee
  • Medical Journal P-value Data - a set of p-values scraped from the abstracts of the New England Journal of Medicine, JAMA, BMJ, The Lancet, and the Amerian Journal of Epidemiology from 2000-2010 created by Jeff Leek.
  • Processed GEUVADIS data - processed data from 667 RNA-seq samples from the GEUVADIS project created by Alyssa Frazee.
  • GEUVADIS BAM files - processed with Tophat2 and created by Alyssa Frazee
  • anitProfiles data - gene expression data from cancer and normal samples used in our paper on classifying with variance created by Hector Corrada Bravo.
  • Peer review data - data from our laboratory experiment of peer review created by Jeff Leek and Margaret Taub.
  • Batch effect data - data on batch effects from a range of genomic technologies created by Jeff Leek, Rafa Irizarry,Hector Corrada Bravo, Ben Langmead, Keith Baggerly, Evan Johnson, Rob Scharpf, and David Simcha.
  • Bladder batch data - data on batch effects from a study of bladder cancer in expression set format created by Jeff Leek
  • Braincloud - data from our paper looking at patterns of variation in gene expression in the human brain created by Carlo Colantuoni.