Is it worth to user --use_16bit flag? Doesn't hurt model's perfomace?

Question

Is it worth to user --use_16bit flag? Doesn't hurt model's perfomace?

martinenkoEduard opened this issue a year ago · 3 comments

Answer 1 · 2023-08-10T16:00:28.000Z

Theoretically, the 16bit_flag lowers the computational precision from 32bits to 16bits, implying a loss in performance. Now, in practice, I haven't seen a noticeable accuracy impact when training with 16bits. But it may depend on the task you are dealing with. Now, from a computational resources point of view, that's another story. Training in float 16 implies a drop in ram consumption, thus allowing you to use models larger than the one you can store in 32bits . Also on recent GPUs, 16bits calculations will very likely imply a computational speed increase, making you training quicker (this improvement can reach a *2 factor). Be aware that this speed improvement is not for every gpus. I would advise you to check on techpowerup whether your GPUs benefits from 16bits calculations or not (if not a huge drop in performance is to be expected).

Answer 2 · 2023-08-14T05:38:02.000Z

Theoretically, the 16bit_flag lowers the computational precision from 32bits to 16bits, implying a loss in performance. Now, in practice, I haven't seen a noticeable accuracy impact when training with 16bits. But it may depend on the task you are dealing with. Now, from a computational resources point of view, that's another story. Training in float 16 implies a drop in ram consumption, thus allowing you to use models larger than the one you can store in 32bits . Also on recent GPUs, 16bits calculations will very likely imply a computational speed increase, making you training quicker (this improvement can reach a *2 factor). Be aware that this speed improvement is not for every gpus. I would advise you to check on techpowerup whether your GPUs benefits from 16bits calculations or not (if not a huge drop in performance is to be expected).

Will it be the same for training and for inference alike?
So if I trained a model using 16bit precision will I be able to do inference using 32 precision?

Answer 3 · 2023-08-14T11:53:10.000Z

I think it is doable, see for instance : https://stackoverflow.com/questions/73454134/change-dtype-of-weights-for-pytorch-pretrained-model