/pytorch-lars

Layer-wise Adaptive Rate Scaling in PyTorch

Primary LanguagePythonMIT LicenseMIT

Watchers