imoneoi/openchat

repeat the output content until the maximum output length is set

Opened this issue · 2 comments

Using Openchat-3.5-0106 locally will repeat the output content until the maximum output length is set. In other words, the output of the model does not stop automatically.
And the model is loaded with the following warning:
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
May I ask how to solve this kind of problem?

Hi @zestaken, can you provide more information about your local model setup?

I also encountered this problem