About CLIP_GRADIENTS VALUE setting

Question

haiasd opened this issue 2 years ago · 1 comments

I find CLIP_VALUE 0.01 in mask2former training setting, is there any reason for this setup. I think it‘s a little small.

Answer 1 · 2023-02-13T14:53:53.000Z

I have the same question. But it seems to make little difference in convergence rate even though the clip_value is so small.