clovaai/overhaul-distillation

train student and set alpha=0, the valid_loss goes up and acc drop a lot

ray-lee-94 opened this issue · 0 comments

image

image

But if i directly train the student model, the training looks good.