tbennun opened this issue 3 years ago · 1 comments
Otherwise, parallelism opportunities are missed as blockDim.x is usually small on its own.
Will close once we port daceml-samples things