tensorcore
There are 12 repositories under tensorcore topic.
Zhen-Dong/HAWQ
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
wmmae/wmma_extension
An extension library of WMMA API (Tensor Core API)
enp1s0/ozIMMU
FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme
YukeWang96/QGTC_PPoPP22
Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.
enp1s0/cuMpSGEMM
Fast SGEMM emulation on Tensor Cores
robbwu/tensorsvm
Fast Kernel SVM on TensorCore enabled GPU
wmmae/hmma.f32.f32
An extension library of WMMA API for single precision matrix operation using TensorCores and error correction technique
wmmae/mma.simt
A software TensorCore using warp shuffle
ShaoKAi100812/CudaCore_TensorCore_Acceleration
Compare the different runtime of CNN computation on CPU and GPU
YukeWang96/APNN-TC_SC21
Artifact for SC21: APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores.
eshibusawa/Simple-Examples
simple examples of tools and libraries
hinofafa/torch_accelerator
Experiments to accelerate GPU device for PyTorch training