/An-AI-Adventure

speaker what you are doing using gpt4-vision

Primary LanguagePython

An-AI-Adventure

speaks what you are doing using gpt4-vision

Setup

Clone this repo, and setup and activate a virtualenv:

python3 -m pip install virtualenv
python3 -m virtualenv venv
source venv/bin/activate

Then, install the dependencies: pip install -r requirements.txt

Make a Replicate, OpenAI, and ElevenLabs account and set your tokens:

export OPENAI_API_KEY=<token>
export ELEVENLABS_API_KEY=<eleven-token>

Make a new voice in Eleven and get the voice id of that voice using their get voices API, or by clicking the flask icon next to the voice in the VoiceLab tab.

export ELEVENLABS_VOICE_ID=<voice-id>

Run it!

In on terminal, run the webcam capture:

python look.py

In another terminal, run the narrator:

python speak.py