tmc-cuda: A C++ repository from tzcnt

tmc-cuda

This is where the planned TooManyCooks - CUDA integration will be. There's nothing here yet...

Provide an awaitable that wraps a CUDA Graph (by attaching a callback at the end of the graph via cudaGraphAddHostNode
Investigate if CUDA provides other methods of async submission / notification.
See if priority can be used to control access to the GPU.