Generation gets interrupted

Question

Generation gets interrupted

Closed this issue 6 months ago · 6 comments

When generating it gets interrupted and stops.

Thanks!

Answer 1 · 2024-02-24T15:13:04.000Z

Please check max token for chat setting and set to -1 for infinite. The model may be sending eos token or the request from the server was ended.

Answer 2 · 2024-02-24T15:21:18.000Z

Sadly still ending abruptly. Tested with:

Answer 3 · 2024-02-24T16:23:40.000Z

Hmm, not sure will have to check. Does it happen in terminal when calling Ollama with the same options? You can enable debug modem in settings to see the request and options.

Answer 4 · 2024-02-27T19:58:14.000Z

Hey @MAKESEB I updated in 3.7.0 to the new version of the Ollama API which supports Open AI specification, it may help with this issue, please report back.

Answer 5 · 2024-02-29T08:49:12.000Z

Updating twinny and setting all token limits to -1 solves the incompletion issue for me. Thanks