/MoE-Infinity

PyTorch library for cost-effective, fast and easy serving of MoE models.

Primary LanguagePythonApache License 2.0Apache-2.0

Issues