qiyuxinlin/exllamav2-KTransformers
A fast inference library for running LLMs locally on modern consumer-class GPUs, supporting DeepSeek and Qwen2 MoE
PythonMIT
Stargazers
No one’s star this repository yet.
A fast inference library for running LLMs locally on modern consumer-class GPUs, supporting DeepSeek and Qwen2 MoE
PythonMIT
No one’s star this repository yet.