fp16 performance
sunacc opened this issue · 1 comments
sunacc commented
Hi dear developer,
I find that fp16 has been supported in NGT, and I wonder how fp16 performs compared to fp32.
Since the memory usage is half, so theoretically can we get 2x performance?
masajiro commented
This fp16 support means that the memory usage is half as you mentioned but the distance is calculated as fp32 after converting fp16 to fp32 with SIMD.