/multimodal-gamer

A framework to enable multimodal models to play games on a computer.

Primary LanguagePythonMIT LicenseMIT

Multimodal Gamer

A framework to enable multimodal models to play games on a computer.

Using the same inputs and outputs as a human operator, the model plays a game!

Key Features

Demo of Multimodal Gamer playing on Chess.com

readme.mp4

Quick install

  1. Create a virtual environment:
    python3 -m venv env
  2. Activate the virtual environment (Mac/Linux):
    source env/bin/activate
  3. Install the required packages:
    pip install -r requirements.txt
  4. Add your OpenAI API key:
    export OPENAI_API_KEY=yourkeyhere

Running the Game

To run a specific game, use the -game flag followed by the name of the game. For example, to play chess:

python main.py --game chess