patch loss
Closed this issue · 11 comments
Why your patch loss use the feature of teacher network?And how to train your self.classifier in dacs.py?
@super233 Actually, the teacher model has no gradient flow.
Lines 136 to 138 in 4eb532e
Patch loss may has not been applied in this code.
Thanks for your reply. Then why your patch loss use the feature of teacher network?
Because I am not author. I also wait for the replay of the author.
Hi @super233 , in our code, we use the teacher branch to perform patch loss for the ablation study, as [zyuanbing] says, there is no gradient flow in the teacher branch you could generate features from the student branch. We will provide the updated code soon
Have you know the author that how to train their self.classifier in dacs.py?
Why your patch loss use the feature of teacher network?And how to train your self.classifier in dacs.py?
请问您知道了吗?
另外,我也没找到作者是如何训练self.cls_head in encoder_decoder.py的。 请问您发现了吗