low-precision

There are 5 repositories under low-precision topic.

intel/neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Language:Python2k 34 187245
Tiiiger/QPyTorch
Low Precision Arithmetic Simulation in PyTorch
Language:Python256 12 5271
gudovskiy/ShiftCNN
A script to convert floating-point CNN models into generalized low-precision ShiftCNN representation
Language:Python55 4 517
sefaburakokcu/quantized-yolov5
Low Precision(quantized) Yolov5
Language:Python28 2 97
gudovskiy/fmap_compression
Code for DNN feature map compression paper
Language:C++11 4 13