try newly released `cudaLaunchCooperativeKernelMultiDevice()` in CUDA C++
Primary LanguageCudaMIT LicenseMIT