NVIDIA/stdexec

Investigate potential `defer_temp_storage_destruction` optimization

gevtushenko opened this issue · 0 comments

Memory deallocation in defer_temp_storage_destruction might be blocking. We should investigate if launching all the destruction kernels in sequence (or fusing destructors in a single kernel) before releasing the memory is any better to address:

// TODO Bury all the memory associated with the stream provider and then 
//      deallocate the memory