Note

This is an experimental project. Output can be unreliable. This is a fun project. If you have any feedback, open an issue. BE RESPONSIBLE with your prompts.

Generative Video

This is a quick POC of generating a video from still images (Generative Images) and audio (text to speech api) with help of Gemini as the language model.

Requirements

  1. Install ffmpeg
  2. Install the python requirements
pip install -r requirements.txt
  1. Gemini API
  2. Google Imagen2 Access
  3. Service Account with aiplatform.endpoints.predict permission
  4. .env file
DS_GOOGLE_API_KEY=your-gemini-api-key
DS_PROJECT_ID=your-project-id
DS_LOCATION=us-central1

How to run

  1. Export the GOOGLE_APPLICATION_CREDENTIALS
  2. Run below command
streamlit run app.py