Why did my learning rate drop from the initial lr

Question

Why did my learning rate drop from the initial lr

sjchasel opened this issue 2 years ago · 3 comments

In every batch, I execute

loss.backward()
optimizer.zero_grad()
optimizer.step()
with warmup_scheduler.dampening():
    lr_scheduler.step()

It doesn't have a warm up process.

Answer 1 · 2022-11-29T12:03:49.000Z

Does the example code work in your environment?
https://github.com/Tony-Y/pytorch_warmup/blob/master/examples/emnist/main.py

Answer 2 · 2022-11-30T05:45:12.000Z

Your code does not optimize model parameters at all because optimizer.zero_grad() is called after loss.backward(). If you cannot understand this, please read the following tutorial:

https://pytorch.org/tutorials/beginner/basics/optimization_tutorial.html#optimizer

Answer 3 · 2022-11-30T13:06:23.000Z

Did you solve this issue?