/nanoMOE

Implementing Sparse MOE on Karpathy's nanoGPT.

Primary LanguagePythonMIT LicenseMIT

Stargazers