/multimodal-gamer

A framework to enable multimodal models to play games on a computer.

Primary LanguagePythonMIT LicenseMIT

Multimodal Gamer

A framework to enable multimodal models to play games on a computer.

Using the same inputs and outputs as a human operator, the model plays a game!

Key Features

  • Compatibility: Designed for various multimodal models.
  • Integration: Currently integrated with GPT-4
  • Future Plans: Support for additional models.

Quick install

Create venv

python3 -m venv env

Activate it (Mac)

source env/bin/activate

Install requirements

pip install -r requirements.txt

Add OpenAI Key

export OPENAI_API_KEY=yourkeyhere

Run it

python main.py