Replacing address formats with 64-bit pointers
Opened this issue · 0 comments
Nicolas-Iskos commented
Given that CUDA now supports shuffle instructions with 64-bit values, would it be possible to replace the 32-bit address formats with full 64 bit pointers within each slab to eliminate the translation cost?