Tools for per layer quantization, fp32, fp16 , PTQ and QAT int8 (int4 not yet implemented)
Primary LanguagePython