/Adan

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

Primary LanguagePythonApache License 2.0Apache-2.0

Stargazers