jinzhen-lin/cutlass_fpA_intB_gemm

A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer

C++Apache-2.0

Readme
0Issues
0Stargazers
0Watchers

No issues in this repository yet.

Contact site admin: Geeks.