MrNeRF/gaussian-splatting-cuda

Does __fmaf really help?

Closed this issue · 1 comments

I see you replace basically every operation with this intrinsics, but according to this there is zero speedup, they serve only for predictable rounding.

Yes, I've had the same experience. I tried it mainly because the NVIDIA CUDA profiler kept suggesting these operations to enhance speed. However, in terms of speed, it doesn't seem to be beneficial at all. Instead, it tends to obfuscate and makes the code more difficult to understand.