A Training Bug

Question

A Training Bug

zjykzj opened this issue 2 years ago · 2 comments

There is a training bug in this project. That is I only set teacher model's require_grad_=False, but still put it's parameters to optimizer. So teacher model will update in training while it doesn't compute grad.

I don't have plan to fix it, because the training result shows it also works well.

Answer 1 · 2021-12-29T13:03:18.000Z

May be that's fine if I had set requires_grad=False. See What is the behavior of passing model parameters with requires_grad == False to an optimizer?

Optimizer has done the filter func implicitly, for example sgd. To be on the safe side, filter the parameters entered into the optimizer

Answer 2 · 2022-05-04T07:57:16.000Z

it's not a bug 👍