computer-using-agent

There are 9 repositories under computer-using-agent topic.

  • a-real-ai/pywinassistant

    The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-user-interfaces (GUIs) by using only natural language. Uses Visualization-of-Thought and Chain-of-Thought reasoning to elicit spatial reasoning and perception, emulates, plans and simulates synthetic HID interactions.

    Language:Python1.3k3216187
  • OS-Agent-Survey/OS-Agent-Survey

    This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).

  • OS-Copilot/ScienceBoard

    Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"

    Language:Python1069
  • reidbarber/webmarker

    Mark web pages for use with vision-language models

    Language:TypeScript44253
  • ahnjaewoo/FlashAdventure

    🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"

    Language:Python13
  • Clad3815/open-computer-use

    AI-powered assistant that controls a Windows environment through docker, allowing automated interaction with the desktop interface. Control your computer with natural language.

    Language:JavaScript11
  • ercbot/valk

    Simple, observable computer use - Remote desktop for AI agents

    Language:Rust4100
  • thethinkmachine/4o-agent

    A ReAct Principles based fully autonomous Command Line Computer Using Agent

    Language:Python00
  • Mihonarium/food_ordering_agent

    Use an LLM agent to automate ordering food and other items from Deliveroo, Uber Eats, DoorDash, etc.

    Language:Python101