Unable to train the model

Question

Unable to train the model

Opened this issue 7 years ago · 2 comments

I tried training your model for around 20,000 iterations on a GPU but when I tested it on the testing data and the unseen data, it is always printing "I don't know".Can someone please help me out?

Answer 1 · 2018-01-12T00:36:46.000Z

You've faced a common problem in seq2seq learning. Based on my experience,it's probably because the learning rate is too small for the model to get out of the 'trap' of same response.Try to use bigger learning rate like 0.2 or even 1.

Answer 2 · 2020-06-18T11:54:20.000Z

What is the code to train the model.