computer-using-agent
There are 9 repositories under computer-using-agent topic.
a-real-ai/pywinassistant
The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-user-interfaces (GUIs) by using only natural language. Uses Visualization-of-Thought and Chain-of-Thought reasoning to elicit spatial reasoning and perception, emulates, plans and simulates synthetic HID interactions.
OS-Agent-Survey/OS-Agent-Survey
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).
OS-Copilot/ScienceBoard
Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"
reidbarber/webmarker
Mark web pages for use with vision-language models
ahnjaewoo/FlashAdventure
🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"
Clad3815/open-computer-use
AI-powered assistant that controls a Windows environment through docker, allowing automated interaction with the desktop interface. Control your computer with natural language.
ercbot/valk
Simple, observable computer use - Remote desktop for AI agents
thethinkmachine/4o-agent
A ReAct Principles based fully autonomous Command Line Computer Using Agent
Mihonarium/food_ordering_agent
Use an LLM agent to automate ordering food and other items from Deliveroo, Uber Eats, DoorDash, etc.