[New Feature]: expose callbacks and ainvoke

Question

[New Feature]: expose callbacks and ainvoke

Closed this issue 19 days ago · 6 comments

ndahn commented a month ago

Checked for duplicates

Yes

Alternatives considered

Yes - and alternatives don't suffice

Related problems

No, see below

Describe the feature request
However, in order for the GUI to stay responsive, deferring long-running tasks is essential. At the moment it is not possible to use ROSA for intermediate updates or react to errors. It would be nice if the ROSA constructor accepted a callbacks parameter to pass to its AgentExecutor. Exposing the ainvoke method would be useful, too.

As an aside, I'm not sure if adding double underscores to all class members is really necessary. It makes inheritance and monkey patching much more annoying :)

Answer 1 · 2024-08-23T15:59:54.000Z

Changed the class methods from private to protected in #9

Answer 2 · 2024-08-23T16:04:44.000Z

Can you elaborate on the intended use case a bit more? I'd like to understand how the callbacks would be used to keep the UI responsive.

Is it that you want to be able to submit arbitrarily many queries to the agent without waiting for the previous query to finish?

How will you keep track of the queries, their intermediate steps, and results?

Mostly just curious, I think it's a good idea either way.

Answer 3 · 2024-08-24T12:15:15.000Z

Sure! Right now I'm running the request inside a thread and just update the UI from there once it returns. There are a couple of things I can't do with this approach (at least as far as I can tell):

stream responses
give user feedback about what tools are used
react to failed tool runs
cancel response generation

From what I saw, langchain uses a callback class and its member functions for handling response updates. I would construct this as I pass the request, then use a closure on my request and generated UI elements to keep the association.

Answer 4 · 2024-08-26T14:42:20.000Z

Check out the astream implementation in rosa.py on feature/streaming. It's using LangChain astream_events method, which provides access to both streaming tokens, tool usage, etc.

Also see turtle_agent demo script for example usage.

One thing I noticed, and have yet to find a solution to, is that LangChain AgentExecutor cannot be pre-empted or cancelled with standard LangChain facilities. This will likely require use of asyncio tasks. What is unclear to me at the moment is whether ROSA should provide that integration, or if it should be left to the consumer (I am leaning more toward the latter).

Answer 5 · 2024-08-29T11:12:23.000Z

Thank you! I'll have some time next week to test it out. I agree though that supporting request cancellation sounds like something langchain should provide.

Answer 6 · 2024-09-02T10:52:50.000Z

Tried it out just now and works like a charm! The example was already very helpful. Seems like you had some fun with it :D Thank you!