/capgen

Capgen is an AI automatic captions generator for videos.

Primary LanguagePythonMIT LicenseMIT

capgen

Capgen is an automatic captions generator for videos. It employs Whisper neural network, offered by OpenAI to generate accurate timestamped captions for your videos in srt file format. Whisper is introduced here, https://openai.com/blog/whisper/. The code for Capgen is a from-scratch re-implementation of the code released by OpenAI. Support for other output formats will be added in the futuređź‘€.

Dependencies

Installing

  1. Download the source code.
  2. cd to capgen\capgen directory.
  3. Download and install ffmpeg cmd.
  4. Run pip install -r requirements.txt to install dependencies.

Usage

  • Run python capgen.py <path-to-video>. An srt file is generated in the same directory.
  • Run python capgen.py -h to see all the available options.