low-precision
There are 5 repositories under low-precision topic.
intel/neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Tiiiger/QPyTorch
Low Precision Arithmetic Simulation in PyTorch
gudovskiy/ShiftCNN
A script to convert floating-point CNN models into generalized low-precision ShiftCNN representation
sefaburakokcu/quantized-yolov5
Low Precision(quantized) Yolov5
gudovskiy/fmap_compression
Code for DNN feature map compression paper