wckdouglas/diffexpr

Expected CPU usage and processing time

lauraluebbert opened this issue · 2 comments

Hi Douglas, thanks for diffexpr! I am using it to run DESeq2 on an scRNAseq dataset with 22k genes, 200 replicates, and 2 conditions (100 replicates per condition), as described in your tutorial. This uses a tremendous amount of CPU (all 48 threads on a Linux machine) and takes ~2 hours. Is this expected?

@lauraluebbert thanks for using the package! I have never use DESeq2 for that many replicates, but that sounds running much longer than I'd expect, were there other covariates other than the condition variable? The core of this package is running the R-version of DESeq2 and I think rpy2 (for converting R-object to python-object and vice versa) only add neglectable overhead.. I would encourage you to test the same comparison in an R environment and see whether that improves... Sorry I'm probably not being very helpful here 🥲

No worries, this was helpful! Feel free to close the issue. :)