/AdamP

Slowing Down the Weight Norm Increase in Momentum-based Optimizers

Primary LanguagePythonMIT LicenseMIT

Stargazers