Pinned Repositories
Binarizing-by-Classification
The official implementation of paper "Binarizing by Classification: Is soft function really necessary?"
coder2gwy
互联网首份程序员考公指南,由3位已经进入体制内的前大厂程序员联合献上。
DGQ
Official Code For Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM
DLMC-QUANT
a model quantization tool
Tengine
Tengine is a lite, high performance, modular inference engine for embedded device
KIVI
[ICML 2024] KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache
ilur98's Repositories
ilur98/DGQ
Official Code For Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM
ilur98/DLMC-QUANT
a model quantization tool
ilur98/Binarizing-by-Classification
The official implementation of paper "Binarizing by Classification: Is soft function really necessary?"
ilur98/coder2gwy
互联网首份程序员考公指南,由3位已经进入体制内的前大厂程序员联合献上。
ilur98/Tengine
Tengine is a lite, high performance, modular inference engine for embedded device