/InfMoE

Inference framework for MoE layers based on TensorRT with Python binding

Primary LanguageC++MIT LicenseMIT

Watchers