Pinned Repositories
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
ByteTransformer
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
acm
competitive programming templates
cutlass
CUDA Templates for Linear Algebra Subroutines
docker_devenv
docker developing environment, ubuntu
docker_vim_ycm
vim with ycm compiled
Huawei-CodeCraft-2019
2019华为软件精英挑战赛,杭夏赛区-咕咕咕,决赛冠军
matxscript
A high-performance, extensible Python AOT compiler.
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
cutlass
CUDA Templates for Linear Algebra Subroutines
kongroo's Repositories
kongroo/Huawei-CodeCraft-2019
2019华为软件精英挑战赛,杭夏赛区-咕咕咕,决赛冠军
kongroo/acm
competitive programming templates
kongroo/docker_devenv
docker developing environment, ubuntu
kongroo/cutlass
CUDA Templates for Linear Algebra Subroutines
kongroo/docker_vim_ycm
vim with ycm compiled
kongroo/matxscript
A high-performance, extensible Python AOT compiler.
kongroo/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators