Optimize an example model with Python, CPP, and CUDA extensions and Ring-Allreduce.
Primary LanguagePythonMIT LicenseMIT