Youmarizer

Youmarizer is a Python application that extracts and summarizes content from YouTube videos. The application performs the following tasks:

Features

Download YouTube videos and convert them to audio.
Split audio into manageable chunks for transcription.
Transcribe audio using the Vosk speech recognition model.
Summarize the transcribed text using a transformer-based summarization model.
Extract and display key points from the summary.

You can install the necessary libraries using pip:

pip install yt-dlp pydub tqdm vosk transformers colorama

Clone the repository:

git clone https://github.com/yourusername/youmarizer.git

Install ffmpeg: Ensure that ffmpeg is installed on your system. You can download it from FFmpeg’s official website and follow the installation instructions for your operating system.
Download the Vosk model: Download the Vosk model from here and extract it to a directory named vosk-model-small-en-us-0.15 within your project folder.
Install Python dependencies: Make sure you have all the required libraries installed by running:

Enter the YouTube video URL:

View the results: - The summary will be displayed, and you will be prompted to press Enter to continue. - Key points extracted from the summary will be displayed with bullet points. ``

main.py: The main script to run the application.
audio_processor.py: Contains functions for downloading, splitting, and processing audio.
transcriber.py: Contains functions for speech recognition and transcription.
summarizer.py: Contains functions for text summarization and key point extraction.

Ensure you have ffmpeg installed for audio processing. You can download it from FFmpeg’s official website.
Adjust the chunk_length_ms parameter in process_audio if you encounter performance issues or need finer control over chunk sizes.

Error: ffmpeg not found: Make sure ffmpeg is installed and added to your system’s PATH.
Model not found: Verify that the Vosk model is correctly downloaded and extracted in the specified directory.

This project is licensed under the MIT License. See the LICENSE file for details.