/WebCanvas_showcase

Showcase connecting representative agent frameworks to online environment evaluation with WebCanvas framework.

Primary LanguagePythonMIT LicenseMIT

Project Name

Showcase connecting representative agent frameworks to online environment evaluation with WebCanvas framework.

To-Do List

  • SEEACT1
  • Tree Search for Language Model Agents2
  • tarsier3

Leave an issue or create a pull request to support your agent. We welcome any feedback!

References

Footnotes

  1. Zheng, Boyuan, et al. "Gpt-4v (ision) is a generalist web agent, if grounded." arXiv preprint arXiv:2401.01614 (2024).

  2. Koh, Jing Yu, et al. "Tree Search for Language Model Agents." arXiv preprint arXiv:2407.01476 (2024).

  3. https://github.com/reworkd/tarsier