dleebrown opened this issue 7 years ago · 0 comments
The way the weights are initialized is incorrect, compared with the scheme in He (2015). This doesn't impact the network too much since it's pretty shallow, but it needs to be fixed.