Why use BN and GN both in JPU module?
haowang1992 opened this issue · 1 comments
haowang1992 commented
Why use both normalization and what kind of normalization affect the performance most?
wuhuikai commented
Experiments show that employing GN in the specific layers can improve performance.