Pinned Repositories
DeepSeek-MoE
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
aphrodite-engine
PygmalionAI's large-scale inference engine
sglang
SGLang is yet another fast serving framework for large language models and vision language models.
aphrodite-engine
Large-scale LLM inference engine
intervitens's Repositories
intervitens/aphrodite-engine
PygmalionAI's large-scale inference engine
intervitens/sglang
SGLang is yet another fast serving framework for large language models and vision language models.