The data generator fails to generate correct number of training data
ChenBaiyang opened this issue · 1 comments
Hi Dear,
I found in the code, DataGenerator.generator() can not generate data properly. The parameters:
- at_least_one_labeled does not work.
- la and test_size are not consistent.
Thank you for your assistance.
Bryan
Moreover, when I manually correct the number of training seeds, I got a result that was quite different from the reports in the paper, for example:
-
Parameter settings:
la=0.05, test_size=0.95, at_least_one_labeled=True, model_dict = {'DevNet': DevNet, 'RF': supervised, 'CatB': supervised} -
Results in terms of AUC:
DevNet RF CatB
6_cardio 0.572856 0.7334 0.847772
25_musk 1.0 1.0 0.999278
26_optdigits 0.974521 0.933136 0.914885
36_speech 0.614187 0.493655 0.497312
40_vowels 0.883842 0.805545 0.885713 -
log see attached.
Thank you very much.
Java Printing.pdf
Bryan