This is where the planned TooManyCooks - CUDA integration will be. There's nothing here yet...
- Provide an awaitable that wraps a CUDA Graph (by attaching a callback at the end of the graph via cudaGraphAddHostNode
- Investigate if CUDA provides other methods of async submission / notification.
- See if priority can be used to control access to the GPU.