Pinned Repositories
awesome-ui-agents
A curated list of of awesome UI agents resources, encompassing Web, App, OS, and beyond (continually updated)
boyugou
GitHub README
boyugou.github.io
Boyu Gou's Homepage
GUI-Agents-Paper-List
Building a comprehensive and handy list of papers for GUI agents
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
llava_uground
Mind2Web
Dataset, code and models for the paper "Mind2Web: Towards a Generalist Agent for the Web".
Mind2Web
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"
SeeAct
[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).
UGround
Official Repo for UGround
boyugou's Repositories
boyugou/GUI-Agents-Paper-List
Building a comprehensive and handy list of papers for GUI agents
boyugou/llava_uground
boyugou/Mind2Web
Dataset, code and models for the paper "Mind2Web: Towards a Generalist Agent for the Web".
boyugou/awesome-ui-agents
A curated list of of awesome UI agents resources, encompassing Web, App, OS, and beyond (continually updated)
boyugou/boyugou
GitHub README
boyugou/boyugou.github.io
Boyu Gou's Homepage
boyugou/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
boyugou/OpenDevin
🐚 OpenDevin: Code Less, Make More
boyugou/WebCanvas
Connect agents to live web environments evaluation.