adjidieng/ETM

getting nan as loss for Short reviews

Mandark27 opened this issue · 2 comments

Is it applicable for short reviews? Minimum how many words must be there in a review for the model to run excluding the stopwords.

I am getting nan as loss since my output tensor from q_theta is a tensor full of nan.

Yes it works for any document size. You must have not have prepared your dataset in the right format or you might have chosen a large learning rate. Also if you choose ReLU as the activation for the inference network for q_theta, make sure you normalize the bag of words input by setting the option bow_norm to 1.

We just added the scripts to pre-process a dataset to the repo. Please check that out and let us know if you still have questions.