cublaslt
There are 5 repositories under cublaslt topic.
Bruce-Lee-LY/cuda_hook
Hooked CUDA-related dynamic libraries by using automated code generation tools.
nghiapq77/face-recognition-cpp-tensorrt
Face Recognition with RetinaFace and ArcFace.
Bruce-Lee-LY/cutlass_gemm
Multiple GEMM operators are constructed with cutlass to support LLM inference.
zhaocc1106/cuxx-programing
cuda、cublas、cublaslt、cusparse...
vadimkantorov/fastmlp
[WIP] PyTorch bindings for cublasLt with an example of quantized i8f16 MLP