CUB is a flexible library of cooperative threadblock primitives and other utilities for CUDA kernel programming.
Primary LanguageCudaBSD 3-Clause "New" or "Revised" LicenseBSD-3-Clause