marimo-team/marimo

Langchain/Langgraph Agent Integration

Opened this issue · 6 comments

Description

My usecase is that I have a LangChain/Langgraph agent that I would like to import and use in marimo.

I understand mo.ai enables llm access, and mo.ui.chat renders a chat box to interface with an llm, with the capability to use tools even currently. I also understand that AI cells are currently possible to write python/sql code.

I think marimo specifically is missing the opportunity to give a unique interface with LLM agents. Instead of rendering a chat window, I'd like to create a "chat" cell in the same way that a languageadapter is built for constructing md/sql cells.

In a data exploration context, this could be very useful to make each llm input+output its own cell. Using mo.ui.chat is insufficient, because data exploration may happen in a sequential order like a chat session, but the "findings" may need to be rearranged to present a synthesized argument, which is why the "notebook" experience is more useful. Similarly, the AI cells are great and quite a bit closer to what would work, but it falls short in that the user query is not saved.

Suggested solution

I would love code that looks something along the lines of

# agent registration for marimo
@app.cell
def __():
  from my_agent_file import my_agent
  mo.ai.register_agent(my_agent(), "MyAgentName") # maybe needs third param to tell method for invoking model? if it differs across agent frameworks

# now, can use in future AI cells, where using the AI cell instead of showing just ["Python", "SQL"] shows ["Python", "SQL", "MyAgentName"]
# but this could also be like a new type of "Agent" or "Chat" cell instead of "AI" or something
# The language adapter for this will essentially use the invoke method for the registered agent:
@app.cell
def __():
  MY_QUERY = "Get the US Census data for 2020 and make a geographic map colored by average age"
  mo.ai.agent("MyAgentName").invoke(MY_QUERY)

# This gives output that shows the streamed agent output thoughts, tool calls and graphs
# ideally, the fetched data would be stored as a df var also automatically in marimo

# I can then have further cells asking questions about the data

Alternative

No response

Additional context

I'm happy to help with implementing this if we can establish a path forward!

I'm currently in the process of resolving package dependencies for the latest LangChain/Graph versions by helping build Pyodide/WASM compatible versions, but i think the lower langchain 0.2 / langgraph 0.1 versions work for now

Thanks for sharing your use-case and example. It would be great to try to scope out what this cell might look like and why it might need to exist (what are the gaps between the Python cell). What would the inputs be (text, images, etc)? and what would the outputs be (markdown, structured logs, charts etc?)

For example, I can get pretty far with 2 cells (instead of 1) to invoke and run an agent. Would this work for a proof-of-concept? Could you share the agent you built with langchain so I can try to understand what further features could be supported?

Image

import marimo

app = marimo.App(width="medium")

@app.cell
def __():
    from agents import my_agent
    return (my_agent,)

@app.cell
def __():
    import marimo as mo
    input = mo.ui.text_area(placeholder="Query").form(bordered=False)
    input
    return input, mo

@app.cell
def __(input, my_agent):
    my_agent(input.value)
    return

if __name__ == "__main__":
    app.run()

Image

In my case, the only input is text for now. Images could be interesting. The outputs in my case can be anything, I'm using mo.output.append() to stream objects out, and the mime rendering of whatever objects are coming out, generally md or dataframes, but i also render genome browsers via the pygv anywidget widget.

2 cells feels clunky for this use case unfortunately if that makes sense?

It's mostly a polish thing for (1) not have to have users use await agent.run() and (2) have it display more chat-like / more of a natural flow especially when in the display mode. It feels like an easy case for a language adapter, or would be nice if I could just hook in my own language adapter if that's possible?

hey @riyavsinha, i am curious how this has evolved over time. have the assumptions changed (only text output)? is there anything you can demo? if its just a polish thing, maybe we keep it as-is until we can discover some changes to the assumption or new requests?

Hi! I made some changes in a fork to support this for myself for now. Essentially, I have a mo.ai.agents.register_agent() method, where I can register agent(s). Then, a "Chat" option in the new cell array can be used to add a cell with this boilerplate:

Image

And when running that in the "Agent" language adapter mode it just looks like:

Image

The output is not text only; its text, dfs, different anywidget widgets like pygv, etc that I specify in my langgraph agent tool output as a response metadata

I also have a method to register DFs to the global context, though I understand that's not something marimo would add since it reduces the ability to statically analyze the code and do reactivity etc

wow this is really awesome to see. thank you for sharing, I'll check out your fork

Sounds good, feel free to message with questions also! the current code is quite hacky. At the simplest level right now you can register any callable eg like an echo agent with mo.ai.agents.register_agent(lambda x: x) to test the UI without setting up an actual langchain agent