Dimensionality reduction is often used to visualize complex expression profiling data. Here, we use the Uniform Manifold Approximation and Projection (UMAP) method on published transcript profiles of 1484 single gene deletions of Saccharomyces cerevisiae. Proximity in low-dimensional UMAP space identifies groups of genes that correspond to protein complexes and pathways, and finds novel protein interactions, even within well-characterized complexes. This approach is more sensitive than previous methods and should be broadly useful as additional transcriptome datasets become available for other organisms.
This is a tutorial to get started working with the Kemmeren et al., yeast deletion strain data in Monocle 3. It includes:
- Dimensionality reduction with UMAP and clustering approaches
- Differential expression testing
- Generating pairwise distances between strains in UMAP space for downstream analysis
To get started, you will need to install Monocle 3
Other helpful R packages are listed at the top of the notebooks.
Michael W. Dorrity, Lauren M. Saunders, Christine Queitsch, Stanley Fields & Cole Trapnell. Dimensionality reduction by UMAP to visualize physical and genetic interactions. Nat Commun 11, 1537 (2020). https://doi.org/10.1038/s41467-020-15351-4
Kemmeren, Patrick, Katrin Sameith, Loes A. L. van de Pasch, Joris J. Benschop, Tineke L. Lenstra, Thanasis Margaritis, Eoghan O’Duibhir, et al. 2014. Large-Scale Genetic Perturbations Reveal Regulatory Networks and an Abundance of Gene-Specific Repressors. Cell 157 (3): 740–52. https://doi.org/10.1016/j.cell.2014.02.054