junrushao/cutlass_fpA_intB_gemm

A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer

C++Apache-2.0

Readme
0Issues
1Stargazer
0Watchers

Stargazers

2kha
Bambou Tree Group

Contact site admin: Geeks.