SIDCo is An Efficient Statistical-based Gradient Compression Technique for Distributed Training Systems
Primary LanguagePythonMIT LicenseMIT