/webcam-audio-description-ai

Capture webcam via browser, generate text descriptions with Google Gemini, generate speech with ElevenLabs

Primary LanguageTypeScript

Webcam Audio Description Generator

Generate audio descriptions for your videos using Google Gemini and ElevenLabs.

Setup

  • cp supabase/functions/.env. example supabase/functions/.env
  • Set your Gemini API key in supabase/functions/.env
  • Set your ElevenLabs API key in supabase/functions/.env

Run locally

supabase start
supabase functions serve --no-verify-jwt
# In another terminal
python3 -m http.server

Open http://localhost:8000/