NaN when width=channel=1, B0

Question

NaN when width=channel=1, B0

charlesxjyang opened this issue 5 years ago · 2 comments

I am using this on my own computer vision dataset. My image size is small enough that when the image size ends up being (batch_size,n_channels,1,1), the EvoNorm begins returning NaN's. I know the reason is because the width=channel=1 because when I make my network smaller with less convolution layers i.e. width,channel>1, the NaN's go away. Is there any reason why this is the case?

Answer 1 · 2020-04-26T16:51:55.000Z

The NaN's are coming from the instance_std and torch.max returns NaN when any of it's elements are NaN. I just added a simple check to see if instance_std was NaN and just return (var+self.eps).sqrt(). The instance_std is most likely NaN because width=channel=1 has no instance variance e.g. variance of a constant is not well-defined in torch, is my guess. I'd be happy to submit a pull request to fix this.

Answer 2 · 2020-04-26T18:11:33.000Z

Hi. Thanks for raising the issue and providing a solution for it. Please submit a PR.