query regarding the target
meghanakaranam1 opened this issue · 0 comments
meghanakaranam1 commented
Hello,
As mentioned in issue #11, I've trained the model on 1.5M smiles. But as specified in the paper, "The dopamine type 2 receptor DRD2 was chosen as the target, and corresponding bioactivity data was extracted from ExCAPE-DB [33]. In this dataset there are 7218 actives (pIC50 > 5) and 343204 inactives (pIC50 < 5)." I have a few questions to ask on this. @MarcusOlivecrona
- is that 1.5M dataset related to DRD2 only?
- if yes then how do I filter 1.5M down to ( 343204+7218)
please help me understand this.
looking forward to hearing from you.