qiyuxinlin

stupid alien

ApprochingAIZhejiang China

Pinned Repositories

InternLM
Official release of InternLM2.5 base and chat models. 1M context support
Language:Python6.6k 59 339464
ktransformers
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Language:Python819 16 6745
exllamav2-KTransformers
A fast inference library for running LLMs locally on modern consumer-class GPUs, supporting DeepSeek and Qwen2 MoE
Language:Python0 0 00
exllamav2.0.0.20
exllamav2 benchmark
Language:Python00
gpt-fast-retrival
kv cache retrival test
Language:Python00
numpy-ml
Machine learning, in numpy
Language:Python0 0 00
pan-light
百度网盘不限速客户端, golang + qt5, 跨平台图形界面
Language:Go0 0 00
triton
Development repository for the Triton language and compiler
Language:C++13.9k 198 1.6k1.7k
exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
Language:Python3.8k 35 489290
CacheBlend
Language:Python61 5 157

qiyuxinlin's Repositories

qiyuxinlin/exllamav2-KTransformers
A fast inference library for running LLMs locally on modern consumer-class GPUs, supporting DeepSeek and Qwen2 MoE
Language:Python0 0 00
qiyuxinlin/exllamav2.0.0.20
exllamav2 benchmark
Language:Python00
qiyuxinlin/gpt-fast-retrival
kv cache retrival test
Language:Python00
qiyuxinlin/numpy-ml
Machine learning, in numpy
Language:Python0 0 00
qiyuxinlin/pan-light
百度网盘不限速客户端, golang + qt5, 跨平台图形界面
Language:Go0 0 00