SPOTlight: Seeded NMF regression to Deconvolute Spatial Transcriptomics Spots with Single-Cell Transcriptomes
This repository contains all the scripts notebooks and reports to reproduce the analysis of our paper SPOTlight: Seeded NMF regression to Deconvolute Spatial Transcriptomics Spots with Single-Cell Transcriptomes. Here we describe which data we used, how to access it, the most important packages and versions used, and how to navigate the directories and files in this repository.
All the data used for this study was publicly available at the Gene Expression Omnibus or through the pertinent web locations. The data in this project can be divided by tissue as well as by modality.
-
Cortical + Hippocampal scRNAseq: Downloaded from the Allen Brain Map, includes 76,533 cells. Specifications on how the data was generated can be found here.
-
Anterior + Posterior ST: Downloaded from the 10X website, includes 2 Anterior and 2 Posterior slices of adult mice.
- PDAC scRNAseq/ST: PDAC scRNAseq and ST data was obtained from the paper Integrating microarray-based spatial transcriptomics and single-cell RNA-seq reveals tissue architecture in pancreatic ductal adenocarcinomas. Data for PDAC-A can be downloaded through GSM3036909, GSM3036910, GSM3405527, GSM3405528, and PDAC-B GSM3405531, GSM3405532, GSM3405533. ST data used in this paper can be accessed through GSM3036911 and GSM4100723.
- PDAC immune scRNAseq: The immune reference dataset used in this study was obtained from the paper Single-cell RNA-seq Highlights Intra-Tumoral Heterogeneity and Malignant Progression in Pancreatic Ductal Adenocarcinoma . It contains 41,986 cells from which 10,623 are immune cells and can be accessed through the Genome Sequence Archive under project PRJCA001063.
- Benchmarking scRNAseq: This data was obtained from the paper Benchmarking single-cell RNA-sequencing protocols for cell atlas projects. It contains scRNAseq data collected with different protocols and sequencing depths.
These are the versions of the most important packages used throughout all the analysis: CRAN:
Docker environments with all the pertinent dependencies are available at dockerhub.
To download the R environment image
# R environment
docker pull marcelosua/spotlight_env_r:latest
To download the R studio image
# Rstudio environment
docker pull marcelosua/spotlight_env_rstudio:latest
This repository contains 3 main analysis directories, each corresponding to the 3 main analysis of the paper. Scripts within each folder are numbered in the order they are to be run.