lbehringer

lbehringer's Stars

jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python6.7k1.2k
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.2k6.4k
openai/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
Language:Python22.3k5.5k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python34.8k4.1k
152334H/tortoise-tts-fast
Fast TorToiSe inference (5x or your money back!)
Language:Jupyter Notebook777179
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python54k5.6k
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Language:Python3.4k306
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Language:Python32.2k2.4k
HazyResearch/safari
Convolutions for Sequence Modeling
Language:Assembly86271
b04901014/MQTTS
Language:Python24935
alvinlindstam/grapheme
A python package for grapheme aware string handling
Language:Python1047
JonathanFly/bark
🚀 BARK INFINITY GUI CMD 🎶 Powered Up Bark Text-prompted Generative Audio Model
Language:Jupyter Notebook98793
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook35.3k4.2k
elevenlabs/elevenlabs-python
The official Python API for ElevenLabs Text to Speech.
Language:Python2.1k239
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python11.5k2.4k
resemble-ai/Resemblyzer
A python package to analyze and compare voices with deep learning
Language:Python2.7k422
AndreevP/wvmos
MOS score prediction by fine-tuned wav2vec2.0 model
Language:Python13513
common-voice/common-voice
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
Language:TypeScript3.3k840
miguelmota/intent-utterance-expander
Expand custom utterance slots of phrases, to use with Alexa Skills Kit Sample Utterances.
Language:JavaScript308
snakers4/silero-models
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
Language:Jupyter Notebook4.9k303
thunlp/OpenDelta
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)
Language:Python97977
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
Language:Jupyter Notebook92.5k14.8k
as-ideas/ForwardTacotron
⏩ Generating speech in a single forward pass without any attention!
Language:Python577113
iisys-hof/HUI-Audio-Corpus-German
This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repository it is possible to automatically recreate the dataset. It is also possible to add more speakers to the processing pipeline.
Language:Python26
dmort27/panphon
Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.
Language:Python21346
openai/openai-cookbook
Examples and guides for using the OpenAI API
Language:MDX58.6k9.3k
NVIDIA/BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
Language:Python84497
py-pdf/pypdf
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
Language:Python8.1k1.4k
jitsi/jiwer
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Language:Python60593
cldf-clts/clts
Cross-Linguistic Transcription Systems
Language:JavaScript143