This repository contains Jupyter Notebooks for transcribing YouTube videos, processing audio and video files, and utilizing various models to transcribe speech into text in different formats.
This notebook handles transcription of YouTube videos, using the YouTube API and Whisper for English and Portuguese languages.
- Connects to the YouTube API to retrieve video data.
- Downloads video/audio and processes the files.
- Utilizes the Whisper model to generate transcriptions for both English and Portuguese languages.
whisper
youtube_dl
torch
ffmpeg
pydub
This notebook focuses on downloading and transcribing YouTube videos.
- Downloads YouTube videos or audio files using the
youtube_dl
library. - Processes audio with Whisper to generate transcriptions.
whisper
youtube_dl
ffmpeg
tqdm
This notebook expands the transcription functionality to support a variety of audio and video formats, beyond YouTube.
- Accepts various media formats, including
.mp3
,.mp4
,.wav
,.opus
, and more. - Uses
pydub
andffmpeg
to convert and process files for transcription. - Outputs text in multiple formats, including
.txt
.
whisper
ffmpeg
pydub
-
Clone the repository:
git clone https://github.com/your_username/your_repository.git cd your_repository
Install the required Python packages:
bash Copiar código pip install -r requirements.txt Run the notebooks using Jupyter or Google Colab.
Dependencies The following libraries are required to run these notebooks:
whisper ffmpeg torch youtube_dl pydub tqdm How to Use Open the notebook in Jupyter or Google Colab. Follow the steps provided in each notebook to transcribe videos or audio files. Transcriptions will be saved in .txt format in the specified directory.