
Add the possibility to 'skip' layers

aromanro opened this issue · 1 comments

This would alleviate the gradient vanishing issue for deeper networks and it's fairly easy to implement, either 'residuals' style (as in ResNet or by simply concatenating an output from a previous layer to the output from the current layer, both being fed into the next one's input.

Things can go quite radical with this, as for example in DenseNet:

I probably won't do it.
For the current code, it would be rather more like densenet and that increases the number of parameters fast.
Unless I implement also convolutional networks, I think it's not worth it.