Pinned Repositories
AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
lectures
Material for gpu-mode lectures
ppl.pmx
ppq
pytorch_quantization
A pytorch implementation of dorefa quantization
ppl.pmx
ppq
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
ppq_tools
Jzz24's Repositories
Jzz24/pytorch_quantization
A pytorch implementation of dorefa quantization
Jzz24/lectures
Material for gpu-mode lectures
Jzz24/ppl.pmx
Jzz24/ppq