pinellolab/scATAC-benchmarking

Center the TF-IDF before irlba?

fransilvion opened this issue · 1 comments

Hi,

Maybe, I missed this but I am just confused. Why don't you center the TF-IDF matrix before irlba function? This function doesn't do centering by default, and as far as I know, you need to center the data before extracting principal components.

Hi ,

That's a fair point. To be honest, I haven't tested it out myself so i'm not sure how standardization will affect the final result.

But here we are trying to benchmark different scATAC methods. We would like to follow the tutorial of the original method as much as possible.

The tutorial for Cusanovich2018 can be found http://atlas.gs.washington.edu/fly-atac/docs/#usecase1 . There they didn't standardize the data before PCA either.

But again, it would be nice to explore the difference of with or without standardization as well. I guess it's just a bit beyond the scope of our work.