alprnbg opened this issue 4 years ago · 0 comments
First of all, thank you for the work.
I wonder why you did not use optimizer.zero_grad() in the torch code ? Is it a technique to get better result or is it just a mistake?