/BGEMM-CUDA

This is a repository of Binary General Matrix Multiply (BGEMM) by customized CUDA kernel. Thank FP6-LLM for the wheels!

Primary LanguageCudaApache License 2.0Apache-2.0

Stargazers