Triton-based implementation of Sparse Mixture of Experts.
Primary LanguagePythonApache License 2.0Apache-2.0