METROEXODUS007

METROEXODUS007's Stars

openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python69k 574 08.1k
openai/openai-cookbook
Examples and guides for using the OpenAI API
Language:MDX58.9k 867 4619.4k
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook47k 305 6625.6k
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
Language:C34.9k 312 1.3k3.6k
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python20.7k 204 3782.1k
chidiwilliams/buzz
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
Language:Python12.2k 85 452916
h2oai/h2ogpt
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
Language:Python11.3k 161 1.1k1.2k
jiaaro/pydub
Manipulate audio with a simple and easy high level interface
Language:Python8.8k 133 5791k
SubtitleEdit/subtitleedit
the subtitle editor :)
Language:C#8.4k 160 4.7k893
Const-me/Whisper
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
Language:C++8.2k 86 221704
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Language:Jupyter Notebook7.9k 118 1.5k1.1k
h2oai/h2o-3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Language:Jupyter Notebook6.9k 388 9.4k2k
Aegisub/Aegisub
Cross-platform advanced subtitle editor
Language:C++3.1k 98 255331
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
Language:Jupyter Notebook2.8k 26 157271
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Language:Python2k 49 126319
Helsinki-NLP/Tatoeba-Challenge
Language:Makefile799 22 3690
SpeechColab/GigaSpeech
Large, modern dataset for speech recognition
Language:Shell634 18 6162
Helsinki-NLP/Opus-MT
Open neural machine translation models and web services
Language:Python605 16 8471
SpeechColab/Leaderboard
SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.
Language:Python429 17 2360
benob/recasepunc
Model for recasing and repunctuating ASR transcripts
Language:Python126 4 1720