Pytorch extension for quantization with high-efficient CUDA kernels
Primary LanguageCudaMIT LicenseMIT