SumanthRH/cuda-resource-stream

CUDA related news and material links

MIT

CUDA MODE Resource Stream

Here you find a collection of CUDA related material (books, papers, blog-post, youtube videos, tweets, implementations etc.). We also collect information to higher level tools for performance optimization and kernel development like Triton and torch.compile() ... whatever makes the GPUs go brrrr.

You know a great resource we should add? Please see How to contribute.

1st Contact with CUDA

An Easy Introduction to CUDA C and C++
CUDA Toolkit Documentation
Basic terminology: Thread block, Warp, Streaming Multiprocessor: Wiki: Thread Block, A tour of CUDA
GPU Performance Background User's Guide
OLCF NVIDIA CUDA Training Series, talk recordings can be found under the presentation footer for each lecture; exercises
GTC 2022 - CUDA: New Features and Beyond - Stephen Jones

2nd Contact

CUDA Refresher

Papers, Case Studies

Books

Tri Dao Fan Section

Practice

Sasha Rush's GPU Puzzles

PyTorch Highlights

Code / Libs

NVIDIA/cutlass

Essentials

Profiling

Nsight Compute Profiling Guide
mcarilli/nsight.sh - Favorite nsight systems profiling commands for PyTorch scripts
Profiling GPU Applications with Nsight Systems

News

SemiAnalysis

Technical Blog Posts

Cooperative Groups: Flexible CUDA Thread Programming

Hardware Architecture

How to contribute

To share interesting CUDA related links please create a pull request for this file. See editing files in the github documentation.

Or contact us on the CUDA MODE discord server: https://discord.gg/jqYdBWreqb