Kevin-XiongC's Stars
DefTruth/CUDA-Learn-Notes
🎉 Modern CUDA Learn Notes with PyTorch: fp32, fp16, bf16, fp8/int8, flash_attn, sgemm, sgemv, warp/block reduce, dot, elementwise, softmax, layernorm, rmsnorm.
mit-han-lab/parallel-computing-tutorial
ztxz16/fastllm
纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行
Evian-Zhang/llvm-ir-tutorial
LLVM IR入门指南
l1nkr/DL-Compiler-Navigation
Machine Learning Compiler Road Map
xprayc/link-load-library-code
<程序员的自我修养> 源代码
f/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
jiangjiajun/PaddleUtils
Some tools to operate PaddlePaddle model
oshino29/ngaArchive
nga论坛帖子的存档
lutzroeder/netron
Visualizer for neural network, deep learning and machine learning models
kimlongli/FiveChess
博弈能力不错的五子棋AI
tsinghua-rll/VoxelNet-tensorflow
A 3D object detection system for autonomous driving.