cuda-cpp
There are 12 repositories under cuda-cpp topic.
xlite-dev/LeetCUDA
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
NVIDIA/cccl
CUDA Core Compute Libraries
MuGdxy/muda
μ-Cuda, COVER THE LAST MILE OF CUDA. With features: intellisense-friendly, structured launch, automatic cuda graph generation and updating.
facebookresearch/CUTracer
A dynamic binary instrumentation tool for tracing and analyzing CUDA kernel instructions.
rbga/CUDA-Merge-and-Bitonic-Sort
Efficient implementations of Merge Sort and Bitonic Sort algorithms using CUDA for GPU parallel processing, resulting in accelerated sorting of large arrays. Includes both CPU and GPU versions, along with a performance comparison.
GPUEngineering/GPUtils
A C++ header-only library for parallel linear algebra on GPUs (CUDA/cuBLAS under the hood)
zheliu137/cusolver_stream
test stream of cusolver
Shikha-code36/CUDA-Programming-Beginner-Guide
A beginner's guide to CUDA programming
Bhargavoza1/cuda_neural_network
learning to develop lightning fast C++/CUDA neural network
MohammadDallash/cuda-cpp-programming
This repo contains some CUDA C++ code examples that demonstrate how to use GPUs for parallel computing. Covering topics such as dynamic parallelization, Optimization, ....etc
zheliu137/GPU_Perf_UnitTest
Test the GPU performance on Linear Algebra Operations. Compare the results with CPP/Fortran
FlosMume/cpp-cuda-starter
CUDA C/C++ starter template for Windows 11 + WSL2 (RTX 4070 SUPER tested)