/cuda

cuda学习示例

Primary LanguageC++

cuda learning notes

TODO

  • fast float to half on cpu site

  • matrix transpose experiment

  • warp shuffle tricks

  • matrix multiply experiment

  • convolution experiment, cuda version of winograd convolution

reduce sum

reduce sum

tensor core

simple