Company:SenseTimeLocation:Beijing
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
A pytorch implementation of dorefa quantization
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.