gpu f16 cast to fp32 calculation, and then converted back?

Question

gpu f16 cast to fp32 calculation, and then converted back?

Opened this issue 5 months ago · 1 comments

For elementwise operations with fp16 input, the data is first converted to fp32, and convert back after call gpu functions? But gpu actually support fp16 and bf16.
bool cast_result_to_fp16 = false;

Answer 1 · 2024-04-30T09:50:20.000Z

besides, does xla support tf32 now?