BayraktarLab/cell2location

Reference data: normalized or not

Opened this issue · 1 comments

Hello Cell2location devs,

Small question: does cell2location expect the expression matrix for scRNA reference data to be:

A. Normalized for library size and log-transformed (as is often done in most single-cell RNA analyses)

...or...

B. Unnormalized counts.

If the answer is B, how does cell2location account for discrepencies in library size between cells?

-Dillon

Hi @Dillon214

It is the option B. The estimated average RNA abundance for every gene and every cell type has to be on a linear counts scale for factorisation of the spatial data to work correctly. You can see the details in the paper methods section and in supplementary methods.