/recountmethylation_bioc2021

Learning materials, including presentation slides and code, cheat sheet, and wiki, for the recountmethylation Bioc2021 software demo.

Primary LanguageHTMLMIT LicenseMIT

recountmethylation_bioc2021

Welcome to the repo for the Bioc2021 software demo/workshop on recountmethylation! Here, you can find key learning resources, including the presentation slides and code, a recountmethylation cheatsheet, and a helpful wiki that links to tutorials, Illumina's platform documentation, and key related Bioconductor libraries.

recountmethylation hexsticker

Learning resources provided in this repo

  • Workshop recording -- Pre-recorded lecture for the recountmethylation software demo/workshop at Bioc2021
  • Presentation slides -- Slides from the Bioc2021 workshop presentation.
  • Presentation code -- Scripts to reproduce the workshop presentation code examples. You can follow along with these scripts during the code break sections of the presentation.
  • Wiki -- Links to platform documentation, Bioconductor dependencies, and tutorials with crucial background.
  • Cheatsheet -- Overview of terms, code, and a decision tree to determine which database file type to download.

Details about the invited software demo/workshop

The demo will happen at 7pm PT, August 4th, 2021. You may follow along by referring to the provided slides and scripts. This page will also link to the pre-recorded lecture video, once it's uploaded to YouTube.

Remote session info

Code break scripts and questions covered

The code break sections of this demo cover fundamental topics to aid an understanding of the file types of the recountmethylation database compilation files.

Code Break #1 questions (script)

  • What is the SummarizedExperiment container/class?
  • What are the classes of SummarizedExperiment objects used to analyze DNAm array data?
  • How do I inspect properties of SummarizedExperiment objects?
  • How do I convert RGChannelSet objects between DNAm array platforms (e.g. from EPIC to HM450K)?
  • How do I combine RGChannelSet objects between different platforms?

Code Break #2 questions (script)

  • How do I download database files from the server (/recount.bio/data) using the getdb() functions?
  • How do I convert from SummarizedExperiment to h5se/HDF5-SummarizedExperiment?
  • How do I save and load an h5se dataset?
  • What are the contents of an h5se directory, and what error occurs when this fails to load?
  • How do I rapidly update the sample metadata, or colData, for an h5se dataset, without overwriting all the assays data?
  • When do I need to recast an h5se object as a basic, non-DelayedArray backed, SummarizedExperiment?

Code Break #3 questions (script)

  • How do I summarize the learned labels for the tissue and disease variables in the sample metadata?
  • How do I query the tissue variable for a specific label, using regular expressions?
  • How do I summarize available samples by study, for a given subset of filtered samples?
  • How would I access GEO in a browser, concurrent with an R session, to better understand my filtered samples?
  • What information should I include as I'm annotating studies represented among filtered samples?

Manuscript citation

If you use recountmethylation in published work, please cite the following paper:

Sean K Maden, Reid F Thompson, Kasper D Hansen, Abhinav Nellore, Human methylome variation across Infinium 450K data on the Gene Expression Omnibus, NAR Genomics and Bioinformatics, Volume 3, Issue 2, June 2021, lqab025, https://doi.org/10.1093/nargab/lqab025