/towards_moe

Implementation of "Towards Understanding Mixture of Experts in Deep Learning", NeurIPS 2022

Primary LanguageJupyter NotebookMIT LicenseMIT

Stargazers