mesolitica/NLP-Models-Tensorflow

AssertionError: assert not np.isnan(cost). How to solve this NAN error. Thanks in advance.

ViswanathaReddyGajjala opened this issue · 8 comments

AssertionError: assert not np.isnan(cost). How to solve this NAN error. Thanks in advance.

Which notebook is this?

https://github.com/huseinzol05/NLP-Models-Tensorflow/blob/master/text-classification/65.gpt-2.ipynb

The notebook is for binary classification. I have modified the code for multi-class classification. But, the accuracy is around 55.5 . By using BERT, I am getting around 75. Is it ok to have such a large margin between those two models? Thanks in advance.

I am also wondering why GPT-2 is not really powerful, maybe my code is not really a good in structure, I might need to check the GPT-2 inside tensorboard

Have you trained the model in an unsupervised manner?

can you elaborate more about unsupervised manner?

Are you using the pre-trained model released by OpenAI?

nope, I just took the model code, and train it from scratch. Maybe you can try to do and let me know the result.

Hi, GPT-2 from OpenAI is not full data they release, as they say, maybe that is the answer