qiyuxinlin/exllamav2-KTransformers

A fast inference library for running LLMs locally on modern consumer-class GPUs, supporting DeepSeek and Qwen2 MoE

PythonMIT

Stargazers

No one’s star this repository yet.