ray-lee-94 opened this issue 5 years ago · 0 comments
But if i directly train the student model, the training looks good.