/SpatialDecon

The SpatialDecon library implements the SpatialDecon algorithm for mixed cell deconvolution in spatial gene expression datasets. (This algorithm also works in bulk expression profiling data.)

Primary LanguageRMIT LicenseMIT

Overview

The SpatialDecon library implements the SpatialDecon algorithm for mixed cell deconvolution in spatial gene expression datasets. (This algorithm also works in bulk expression profiling data.)

Details can be found in the SpatialDecon manuscript: Danaher & Kim (2022), "Advances in mixed cell deconvolution enable quantification of cell types in spatial transcriptomic data." https://www.nature.com/articles/s41467-022-28020-5

Installation

Download the release version from Bioconductor

https://bioconductor.org/packages/release/bioc/html/SpatialDecon.html

Install the release version from Bioconductor

if (!requireNamespace("BiocManager", quietly=TRUE))
    install.packages("BiocManager")

BiocManager::install(version="release")

BiocManager::install("SpatialDecon")

Documentation

To learn how to start using SpatialDecon, view documentation for the version of this package installed in your system, start R and enter:

browseVignettes("SpatialDecon")

Guide to functions

Data preparation functions:

  • "download_profile_matrix" Downloads any one of ~75 cell profile matrices compiled for deconvolution of diverse tissue types.
  • "create_profile_matrix" Creates custom profile matrix from single cell data: count matrix and cell type annotations.
  • "derive_GeoMx_background_at_normalized_scale" Estimates the background expected from each data point in a GeoMx dataset. Accurate background estimation is key for SpatialDecon's accuracy.

SpatialDecon: the core deconvolution function:

spatialdecon runs the SpatialDecon algorithm for estimating mixed cell type abundance in the regions of spatially-resolved gene expression studies. It is also appropriate to apply to bulk gene expression data.

Its minimal required input is:

  • A normalized data matrix
  • A matrix of expected background counts at each element of the normalized data matrix
  • A matrix of expected cell type expression profiles

spatialdecon has numerous advanced options specified in the help file and demonstrated in the vignette. These include:

  • Merge closely-related cell types
  • Estimate cell abundance on the scale of absolute cell counts
  • Use regions of pure tumor cells (or any other cell type missing from the cell profile matrix) to infer a missing cell type's profile and estimate it alongside the cells with known profiles.

Plotting functions:

  • "TIL_barplot" is a convenient way to draw barplots of cell type abundance/ proportion
  • "florets" is for plotting cells in space. For each data point specified in xy space, it draws a circular barplot showing the localized abundance of cell types.

Post-deconvolution analyses:

  • "reverseDecon" Models genes ~ decon-derived cell scores. It produces fitted values, residuals, and various metrics of how much genes depend on cell mixing. In studies where cell mixing is a dominant source of variance, these residuals aid interpretation.

Other

Computational benchmarking

The below memory usage and runtimes were gathered from applying spatialdecon to increasing numbers of GeoMx AOIs, using a 544-gene x 18 cell-type cell profile matrix:

n memory (MB) runtime (ms)
10 324 500
50 1598 2190
100 3130 4280
200 6266 8610
500 15244 21610
1000 30451 42580
5000 61522 83720