facebookresearch/Mask2Former

About CLIP_GRADIENTS VALUE setting

haiasd opened this issue · 1 comments

I find CLIP_VALUE 0.01 in mask2former training setting, is there any reason for this setup. I think it‘s a little small.

I have the same question. But it seems to make little difference in convergence rate even though the clip_value is so small.