A modern GUI application that transcribes and translates audio and video files, offering the option to save the subtitles as separate files, embed the subtitles in a .mkv format, or hardcode them into the video using OpenAI Whisper and ffmpeg.
Tested on Windows with an Nvidia GPU.
- Python version 3.9 or newer
- Torch with Cuda
- ffmpeg
- customtkinter
For more information please visit OpenAI Wishper github: https://github.com/openai/whisper
- Install the required dependencies by running the following command:
pip install customtkinter Pillow torch -f https://download.pytorch.org/whl/torch_stable.html openai-whisper
- Install fffmpeg:
# on Ubuntu or Debian
sudo apt update && sudo apt install ffmpeg
# on Arch Linux
sudo pacman -S ffmpeg
# on Windows using Chocolatey (https://chocolatey.org/)
choco install ffmpeg
# on Windows using Scoop (https://scoop.sh/)
scoop install ffmpeg
- Run the application by executing the following command:
python main.py