/scCATCH

Automatic Annotation on Cell Types of Clusters from Single-Cell RNA Sequencing Data

Primary LanguageRGNU General Public License v3.0GPL-3.0

scCATCH v3.2.1

R >4.0 installed with CRAN download CellMatch

Automatic Annotation on Cell Types of Clusters from Single-Cell RNA Sequencing Data

Recent advance in single-cell RNA sequencing (scRNA-seq) has enabled large-scale transcriptional characterization of thousands of cells in multiple complex tissues, in which accurate cell type identification becomes the prerequisite and vital step for scRNA-seq studies. Currently, the common practice in cell type annotation is to map the highly expressed marker genes with known cell markers manually based on the identified clusters, which requires the priori knowledge and tends to be subjective on the choice of which marker genes to use. Besides, such manual annotation is usually time-consuming.

To address these problems, we introduce a single cell Cluster-based Annotation Toolkit for Cellular Heterogeneity (scCATCH) from cluster marker genes identification to cluster annotation based on evidence-based score by matching the identified potential marker genes with known cell markers in tissue-specific cell taxonomy reference database (CellMatch).

CellMatch includes a panel of 353 cell types and related 686 subtypes associated with 184 tissue types, and 2,096 references of human and mouse.

Install

# install from cran

install.packages("scCATCH")

OR

# install devtools and install

install.packages(pkgs = 'devtools')
devtools::install_github('ZJUFanLab/scCATCH')

Usage

The scCATCH mainly includes two function findmarkergene() and findcelltype() to realize the automatic annotation for each identified cluster as detailed below:

# sc_data is the scRNA-seq data matrix 
# sc_cluster is a character containing the cluster information

obj <- createscCATCH(data = sc_data, cluster = sc_cluster)

# find marker gene for each cluster
obj <- findmarkergene(obj, species, marker, tissue, cancer)

# find cell type for each cluster
obj <- findcelltype(obj)

For more detailed information, please refer to the document and tutorial vignette. Available tissues and cancers see the wiki page

obj <- createscCATCH(data = Seurat_obj[['RNA']]@data, cluster = as.character(Idents(Seurat_obj)))

Cite

Please cite us as Shao et al., scCATCH:Automatic Annotation on Cell Types of Clusters from Single-Cell RNA Sequencing Data, iScience, Volume 23, Issue 3, 27 March 2020. doi: 10.1016/j.isci.2020.100882. PMID:32062421