/Min-MoE

simple implementation of Mixture of Experts in Tensorflow

Primary LanguageJupyter Notebook

Watchers