SRU
Closed this issue · 3 comments
Neoanarika commented
Implementing SRU u first because it is easier and will speed up training
Neoanarika commented
Implemented SRU on the SRU branch, build ontop of the weight visualisation branch.
Neoanarika commented
However the SRU don't have batch norm which the paper uses with it's LSTM
Neoanarika commented
Ok implemented Batch normalisation