-
fast float to half on cpu site
-
matrix transpose experiment
-
warp shuffle tricks
-
matrix multiply experiment
-
convolution experiment, cuda version of winograd convolution
fast float to half on cpu site
matrix transpose experiment
warp shuffle tricks
matrix multiply experiment
convolution experiment, cuda version of winograd convolution