dalel487/su2hmc

Convert Conjugate gradient to single precision

Closed this issue · 3 comments

We need to copy the device float array x2_f into the managed double array x2. Because this is CUDA a simple x2_f[i]=(float)x2[i] won't cut it. One option is to change x2_f to be managed too. Need to see where the managed array is used though and who that'll affect memory performance.

So far so good for congradq. Still need to work on congradp

Removing bug label for now as Congradp does calculate correctly in DP

We may need to look at the CUDA version of Congradp again at a later date