Large Batch Training of Convolutional Networks with Layer-wise Adaptive Rate Scaling
Primary LanguagePythonMIT LicenseMIT