junrushao/cutlass_fpA_intB_gemm
A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer
C++Apache-2.0
A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer
C++Apache-2.0