/cutlass_fpA_intB_gemm

A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer

Primary LanguageC++Apache License 2.0Apache-2.0

No issues in this repository yet.