CamaraLab/STvEA

Normalization of CODEX data

Closed this issue · 1 comments

Hello. I don't understand how you normalize the CODEX data in your article.
You mentioned the following:
We normalized the processed CODEX data by the total levels in each cell
where Mhi is the level of antigen h in cell i before normalization. After this process, antigen levels are well approximated by a two- component Gaussian mixture model where the Gaussian with the highest median corresponds to the sig- nal component, and the mixing parameter ah represents the proba- bility of a measurement of antigen h actually coming from the background. Upon fitting the model to the data using the expectation- maximization algorithm for maximum likelihood estimation, we filtered out the background component of the data by considering
the probabilities"

Could you please tell me where is the code to perform this normalization.
Thanks a lot
Leandro Balzano

The CleanCODEX() function performs both parts of this normalization: first, it divides each CODEX value by the total expression per cell, then fits a two component Gaussian mixture model and takes the new normalized expression as the value of the CDF of the higher Gaussian component at each expression value.