Offload matrix value reordering to device.
greole opened this issue · 1 comments
greole commented
Currently, the matrix values are reordered in serial on the host. This should be avoided and offloaded to the device.
TODO:
- Test whether it is more expensive to create COO on the device and convert to CSR
- or keep sorting the elements before creating COO