theislab/scib

Questions about the weighted score design of scIB

HelloWorldLTY opened this issue · 1 comments

Hi, I have a quick question about the setting of the weighted sum:
image

I understand to assgin S_bio with 0.6 and S_batch as 0.4 are to ensure bio convservation is more important. However, I wonder what is the motivation for choosing such weight combination. Shall we choose S_bio has weight as 0.7, for example, for some datasets or some tasks? Do we need to perform grid search for this weight in our practical applications? Thanks.

Hi, this is a very good question. We somewhat arbitrarily chose the weighting to ensure that bio metrics have more importance than the batch metrics, however you might want to try out different weightings , depending on the number of metrics you use and the biological question at hand. @LuckyMD what are your opinions on this?