/agent.exe

Primary LanguageTypeScriptMIT LicenseMIT

Presenting Agent.exe: the easiest way to let Claude's new computer use capabilities take over your computer!

buy pizza
book-flights.mp4

Motivation

I wanted to see how good Claude's new computer use APIs were, and the default project they provided felt too heavyweight. This is a simple Electron app that lets Claude 3.5 Sonnet control your local computer directly. I was planning on adding a "semi-auto" mode where the user has to confirm each action before it executes, but each step is so slow I found that wasn't necessary and if the model is getting confused you can easily just hit the "stop" button to end the run.

Getting started

  1. git clone https://github.com/corbt/agent.exe
  2. cd agent.exe
  3. npm install
  4. Rename .env.example --> .env and add your Anthropic API Key
  5. npm start
  6. Prompt the model to do something interesting on your computer!

Supported systems

  • MacOS
  • Theoretically Windows and Linux since all the deps are cross-platform

Known limitations

  • Only works on the primary display
  • Lets an AI completely take over your computer
  • Oh jeez, probably lots of other stuff too

Tips

  • Claude really likes Firefox. It will use other browsers if it absolutely has to, but will behave so much better if you just install Firefox and let it go to its happy place.

Roadmap

  • I literally wrote this in 6 hours, probably isn't going anywhere. But I will review PRs and merge them if they seem cool.