METROEXODUS007's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
openai/openai-cookbook
Examples and guides for using the OpenAI API
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
chidiwilliams/buzz
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
h2oai/h2ogpt
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
jiaaro/pydub
Manipulate audio with a simple and easy high level interface
SubtitleEdit/subtitleedit
the subtitle editor :)
Const-me/Whisper
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
h2oai/h2o-3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Aegisub/Aegisub
Cross-platform advanced subtitle editor
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Helsinki-NLP/Tatoeba-Challenge
SpeechColab/GigaSpeech
Large, modern dataset for speech recognition
Helsinki-NLP/Opus-MT
Open neural machine translation models and web services
SpeechColab/Leaderboard
SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.
benob/recasepunc
Model for recasing and repunctuating ASR transcripts