/cublasHgemm-P100

Code for testing the native float16 matrix multiplication performance on Tesla P100 and V100 GPU based on cublasHgemm

Primary LanguageCudaMIT LicenseMIT

No issues in this repository yet.