Curated Data Sets
- tidypvals - data from over 2.5 million published p-values compiled by Jeff Leek.
- recount2 - large collection of RNA-seq data created by Leo Collado Torres, Ben Langmead, and Abhi Nellore.
- Recount a database of free RNA-seq data sets at the gene count level created by Alyssa Frazee
- Medical Journal P-value Data - a set of p-values scraped from the abstracts of the New England Journal of Medicine, JAMA, BMJ, The Lancet, and the Amerian Journal of Epidemiology from 2000-2010 created by Jeff Leek.
- Processed GEUVADIS data - processed data from 667 RNA-seq samples from the GEUVADIS project created by Alyssa Frazee.
- GEUVADIS BAM files - processed with Tophat2 and created by Alyssa Frazee
- anitProfiles data - gene expression data from cancer and normal samples used in our paper on classifying with variance created by Hector Corrada Bravo.
- Peer review data - data from our laboratory experiment of peer review created by Jeff Leek and Margaret Taub.
- Batch effect data - data on batch effects from a range of genomic technologies created by Jeff Leek, Rafa Irizarry,Hector Corrada Bravo, Ben Langmead, Keith Baggerly, Evan Johnson, Rob Scharpf, and David Simcha.
- Bladder batch data - data on batch effects from a study of bladder cancer in expression set format created by Jeff Leek
- Braincloud - data from our paper looking at patterns of variation in gene expression in the human brain created by Carlo Colantuoni.