microsoft/mscclpp

[feature] `sm_channel.put` can only copy from global memory

saeedmaleki opened this issue · 0 comments

@abhijangda found a performance problem with put function where we enforce using assembly for loading from global memory. This is not idea as the user might have the data in shared memory or registers.

@abhijangda is looking for a clean solution.