- Python
sounddevice
for recording audio until you stop speaking - Whisper API for transcribing audio
- OpenAI TTS for speech
- PyWinCtl and pyautogui for screenshots of a specific window
- OpenAI Vision API to process the screenshot and answer your prompt
python -m venv venv
. venv/bin/activate
pip install -r requirements.txt
python main.py
All project-wide settings are in settings.py.