Training two layer networks fails with gradient errors
mihaelacr-google opened this issue · 2 comments
mihaelacr-google commented
I have added a createBidi2 method in the CLSTMText class function, but training 2 layer networks fails with a gradient assertion error, regardless of the used learning rate and hidden layer sizes.
My code to create two layer here.
The error I am getting is
clstmfiltertrain: clstm.cc:642: void ocropus::GenericNPLSTM<F, G, H>::backward() [with int F = 1; int G = 2; int H = 2]: Assertion `!anynan(out)' failed.
Looks like there is an error when computing the gradient of two layer networks.
abhigarg commented
I am facing the same problem.
lorenzob commented
If this can be of any help to someone, I got the exact same error when I run the clstmocrtrain without the corresponding *.gt.txt files in the image folder.