junrushao/cutlass_fpA_intB_gemm
A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer
C++Apache-2.0
No issues in this repository yet.
A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer
C++Apache-2.0
No issues in this repository yet.