`kaiming_uniform_` will lead to `nan` for RealNVP

Question

`kaiming_uniform_` will lead to `nan` for RealNVP

Closed this issue 3 years ago · 1 comments

The output of RealNVP can be very high. This is because it have torch.exp is the forward process:

y = x * torch.exp(self.logs(x)) + self.t(s)

Is there any method that do all requirement below at the same time:

avoid very large number
avoid very small number

Answer 1 · 2021-05-23T01:43:08.000Z

This seems be a issue off RealNVP, instead of a code bug. To solve this problem, I've added a soft clip option for RealNVP. Any number x that |x|>clip will become clip by using a function like clip * tanh(x / clip). However, this will not perfectly solve the problem for very deep models.