- This project is an automatic speech recognition application that takes a YouTube video link or a video file as input to generate a video with subtitles.
- You can also upload an audio file to generate a transcript as .txt, .vtt, .srt files.
- The application performs 2 tasks:
- Detects the language, transcribes the input video in its original language.
- Detects the language, translates it into English and then transcribes.
- Downloaded the video of the input link using pytube.
- Generated a transcription of the video using the OpenAI Whisper model.
- Saved the transcriptions as .txt, .vtt and .srt files.
- Generated a subtitled version of the input video using ffmpeg.
- Displayed the original video and the subtitled video side by side.
- Built a multipage web app using Streamlit and hosted on HuggingFace Spaces.
- You can download the generated .txt, .vtt, .srt files and the subtitled video.
- You can use the app via this link.
pendave/Auto-Subtitled-Video-Generator
Input a YouTube video link or upload a video file and get a video with subtitles.
PythonMIT