/Dynamic_MoE

Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"

Primary LanguagePython