MuYu-zhi

Pinned Repositories

MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba
Language:C++8.9k 198 2.6k1.7k
AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Language:Python1.8k 15 423223
qserve
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
Language:Python469 8 3728
mlc-llm
Universal LLM Deployment Engine with ML Compilation
Language:Python19.5k 177 1.4k1.6k
filesystem_spec
A specification that python filesystems should adhere to.
Language:Python0 0 01
xla
A machine learning compiler for GPUs, CPUs, and ML accelerators
Language:C++2.8k 41 390454
mnn-llm
llm deploy project based mnn.
Language:C++1.5k 28 206167

MuYu-zhi/filesystem_spec
A specification that python filesystems should adhere to.
Language:Python0 0 01