Implements https://arxiv.org/abs/1711.05101 AdamW optimizer and cosine learning rate scheduler for PyTorch framework
Primary LanguagePythonMIT LicenseMIT