getting nan as loss for Short reviews

Question

getting nan as loss for Short reviews

Mandark27 opened this issue 5 years ago · 2 comments

Is it applicable for short reviews? Minimum how many words must be there in a review for the model to run excluding the stopwords.

I am getting nan as loss since my output tensor from q_theta is a tensor full of nan.

Answer 1 · 2019-11-22T22:47:43.000Z

Yes it works for any document size. You must have not have prepared your dataset in the right format or you might have chosen a large learning rate. Also if you choose ReLU as the activation for the inference network for q_theta, make sure you normalize the bag of words input by setting the option bow_norm to 1.

Answer 2 · 2019-12-08T21:58:57.000Z

We just added the scripts to pre-process a dataset to the repo. Please check that out and let us know if you still have questions.