/swa

Averaging Weights Leads to Wider Optima and Better Generalization

Primary LanguagePythonMIT LicenseMIT

Stargazers