TextGenerationOptions is totally not used

Question

TextGenerationOptions is totally not used

AsakusaRinne opened this issue a year ago · 9 comments

TextGenerationOptions is a parameter of ITextGeneration.GenerateTextAsync. However currently it seems to be not used anywhere.

For some API service like OpenAI Chatgpt, stop sequence is not so important. However for local model inference, the model will endlessly generate response without a stop sequence.

Could you please expose TextGenerationOptions to AskAsync API to let users configure the settings themselves? It will help a lot for local LLM inference integration.

Answer 1 · 2023-11-06T06:33:28.000Z

If possible, I hope that the method of calculating the number of tokens can also provide custom configuration.

Answer 2 · 2023-11-14T09:51:03.000Z

Any updates? @dluc I understand that at the beginning stage of a project, there's always short of hands. Please at least let us know if it would be solved in the future.

Answer 3 · 2023-11-17T00:40:42.000Z

Sorry we didn't have an opportunity to look into this yet, but we always keep an eye on the list of open issues, so we'll provide an update as soon as possible.

Answer 4 · 2023-11-17T09:14:27.000Z

Ok, I'm looking forward to it. Thank you for your works anyway.

Answer 5 · 2023-12-08T23:47:05.000Z

I noticed that LLama would generate tokens ad infinitum (almost, at some point it throws an exception). SearchClientConfig.AnswerTokens will be passed as TextGenerationOptions.MaxTokens.

I'll look into adding the options to the Ask API, so the behavior can be managed more easily.

Answer 6 · 2023-12-09T14:48:24.000Z

Thank you a lot! I'm looking forward to it

Answer 7 · 2024-03-06T08:45:41.000Z

Update: @marcominerva added new settings to SeachClientConfig, see #341, allowing to configure Temperature, TopP and other LLM request settings. These will soon be used by AskAsync, PR coming soon.

Answer 8 · 2024-03-07T18:39:07.000Z

Fixed: see #341 and #344

Answer 9 · 2024-03-07T19:24:27.000Z

@dluc Thank you Devis! We'll keep track with the next release and apply it in LLamaSharp