iusztinpaul/hands-on-llms

The Trained model is not providing any response

bhadreshpsavani opened this issue · 3 comments

I trained model end to end and once i deployed it on gradio for chat assistant, It was giving empty reponse for all the query

Hello,

Have you trained it for the same number of epochs as in our config or more?

Hello @iusztinpaul,

I trained it with the same number of epochs.

I was getting charts like this on comet.
Screenshot_20240103-080306_Chrome.png

We had the same behavior when training for too many epochs. This is probably because there is not enough variety in the dataset, and the model overfits.

Here are 3 ways to debug this issue:

  1. Try to use our model to run the inference to see that everything works fine
  2. Lower the number of epochs
  3. If lowering the number of epochs doesn't work, try also to lower the learning rate.

Let me know if this helps.