repeat the output content until the maximum output length is set
Opened this issue · 2 comments
zestaken commented
Using Openchat-3.5-0106 locally will repeat the output content until the maximum output length is set. In other words, the output of the model does not stop automatically.
And the model is loaded with the following warning:
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
May I ask how to solve this kind of problem?
Cancerxy commented
I also encountered this problem