/GeneSpectra

GENE classification and cross-SPECies comparison of cell Types with single cell RNA-seq data

Primary LanguagePython

Gene classification based on cell type expression specificity and distribution

The GeneSpectra module performs gene classification using scRNA-seq data.

Read our preprint here: Context-aware comparison of cell type gene expression across species

Steps:

  1. Reduce sparsity by creating metacells or pooling
  2. Normalize data and filter low count genes
  3. Multi-thread gene classification for gene specificity and distribution
  4. Compare orthologs classes between species

Note that the gene classes are modified based on Human Protein Atlas classifications by Karlsson, M. et al.

Modules

Metacells

Wrapper functions and helper functions to use metacells to create metacells based on scRNA-seq data. It is also recommended to follow the official metacells workflow to create the most tailored metacells anndata object (use the iterative vignette for brand-new data), as you have more freedom to adjust various parameters. Alternatively, when the dataset is unsuitable for metacell calculation, merge cells of the same annotation label to create cell pools.

Gene classification

Core module to perform gene filtering, normalization, and gene specificity and distribution classification. Uses multi-processing to parallize the processing of genes.

Cross species

Cross species comparison of gene classes, and plotting. Using ensembl or eggNOGV6 homology.

Developer / maintainer: Yuyao Song, ysong@ebi.ac.uk