Ieremie opened this issue 3 years ago · 0 comments
The paper does not mention the use of Batch Normalization in the case of the audio task.
In the case of the Vision task, it mentions that '' We did not use Batch-Norm [38]."