computer-use-agent
There are 21 repositories under computer-use-agent topic.
magentic-ui
A research prototype of a human-centered web agent
bytebot
Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containerized Linux desktop environment.
Agent-S
Agent S: an open agentic framework that uses computers like a human
OpenCUA
OpenCUA: Open Foundations for Computer-Use Agents
SEAgent
Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"
ScienceBoard
Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"
TuriX-CUA
This is the official website for TuriX Computer-use-Agent
TongUI-agent
Release of code, datasets and model for our work TongUI: Building Generalized GUI Agents by Learning from Multimodal Web Tutorials
os-harm
OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents
MacOS-Agent
A powerful automation agent for macOS that enables natural language control of various system applications and services. This agent allows you to interact with your Mac using simple text commands, automating tasks across multiple applications including Finder, TextEdit, Preview, and more.
bro
An LLM computer-using agent (CUA) designed to autonomously perform mundane tasks related to business operations and administration, such as doing accounting, filing paperwork, and submitting applications. The accountant is not your bro, but Bro is.
wayland-mcp
MCP Server for Wayland
SUDO
🤖 "sudo rm -rf agentic_security" – Investigating computer-use agent security
docker-knapsack-llm
Computer use Docker Player (LLM Research)
overlay-companion-mcp
A general-purpose, human-in-the-loop AI-assisted screen interaction toolkit.
os-ai-computer-use
AI controls your OS. OS AI Computer Use, OS and API agnostic. For now on Anthropic (Claude) API.
yc-cofounder-bot
Browser automation for YC cofounder matching using OpenAI Computer Use API and Playwright.
CUI-X-FreeAS
下一代MAS(Multi-Agent System)框架,支持MCP,支持无限拓展Agent……详见仓库。
Build_LLM_PROMPT_Agent_computerTool
Innovative new code that leverages Agents SDK and the computer-use-preview openai api model . The user input a query and the app builds the config and JSON to "visually" search the web for products based on the LLM prompt generated. Test code for proof of concept
Claude-GUI
About Mini app where Claude moves the mouse to interact with an HTML page, and uses that interaction to trigger or reflect something in a Flask backend.
Browser-Use-Agent-GUI
Linux GUI for initiating and monitoring Browser Use with an exit switch