[feature] `sm_channel.put` can only copy from global memory
saeedmaleki opened this issue · 0 comments
saeedmaleki commented
@abhijangda found a performance problem with put function where we enforce using assembly for loading from global memory. This is not idea as the user might have the data in shared memory or registers.
@abhijangda is looking for a clean solution.