/pytorch-gradual-warmup-lr

Gradually-Warmup Learning Rate Scheduler for PyTorch

Primary LanguagePythonMIT LicenseMIT

pytorch-gradual-warmup-lr

Gradually warm-up(increasing) learning rate for pytorch's optimizer. Proposed in 'Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour'.

example tensorboard

Example : Gradual Warmup for 100 epoch, after that, use cosine-annealing.

Install

$ pip install git+https://github.com/saareliad/pytorch-gradual-warmup-lr.git

Usage

from warmup_scheduler import GradualWarmupScheduler

scheduler_cosine = torch.optim.lr_scheduler.CosineAnnealingLR(optimizer, max_epoch)
scheduler_warmup = GradualWarmupScheduler(optimizer, multiplier=8, total_epoch=10, after_scheduler=scheduler_cosine)

for epoch in range(train_epoch):
    # train(...)
    # validate(...)
    scheduler_warmup.step()     # 10 epoch warmup, after that schedule as scheduler_cosine
    ...

Changes from original repo

This is my own fork of https://github.com/ildoonet/pytorch-gradual-warmup-lr.git My changes add compatibility with Pytorch >= 1.1, See discussion here (Kept on my own fork to maintain stability across existing projects)