Georgehappy1

Georgehappy1's Stars

mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Language:Jupyter Notebook37.7k 396 674k
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python33.5k 204 1.2k3.8k
Plachtaa/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
Language:Python7.6k 81 152756
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Language:Jupyter Notebook7.5k 89 128739
huggingface/parler-tts
Inference and training library for high-quality TTS models.
Language:Python4.3k 55 97432
metavoiceio/metavoice-src
Foundational model for human-like, expressive TTS
Language:Python3.8k 78 125652
lucidrains/self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
Language:Python1.3k 23 1773
PolyAI-LDN/conversational-datasets
Large datasets for conversational AI
Language:Python1.3k 74 30167
0nutation/SpeechGPT
SpeechGPT Series: Speech Large Language Models
Language:Python1.2k 45 4383
Vaibhavs10/open-tts-tracker
1.1k 64 1569
k2-fsa/icefall
Language:Python902 48 649287
ga642381/speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
618 32 227
ZhangXInFD/SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
Language:Python433 15 1439
KdaiP/StableTTS
Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3
Language:Python347 23 2039
metame-ai/awesome-audio-plaza
Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation
313 30 211
Rongjiehuang/GenerSpeech
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
Language:Python313 17 2845
huggingface/dataspeech
Language:Python280 13 1537
dubverse-ai/MahaTTS
Language:Python249 13 1617
PolyAI-LDN/pheme
Language:Python246 11 1923
fishaudio/audio-preprocess
Preprocess Audio for training
Language:Python232 8 945
CODEJIN/NaturalSpeech2
Language:Jupyter Notebook139 13 1215
neonbjb/DL-Art-School
DLAS - A configuration-driven trainer for generative models
Language:Python136 5 20136
0nutation/USLM
Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)
Language:Python128 8 411
AudiogenAI/agc
Audiogen Codec
Language:Python118 3 111
uniaudio666/UniAudio
The official source code of UniAudio
Language:Python81 8 26
NVIDIA/RAD-MMM
A TTS model that makes a speaker speak new languages
Language:Roff73 5 16
ex3ndr/supervoice-gpt
GPT-style network for phonemization with durations of text
Language:Jupyter Notebook61 6 49
scutcsq/Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction
Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (arXiv:2401.01498)
Language:Python58 8 24
innnky/descript-audio-vae
VAE modified from Descript Audio Codec, which replaces the RVQ with VAE
Language:Python43 8 15
jaehyeongAN/RedisAI-demo
Language:Python4 2 01

Georgehappy1

Georgehappy1's Stars

mlabonne/llm-course

RVC-Boss/GPT-SoVITS

Plachtaa/VALL-E-X

jasonppy/VoiceCraft

huggingface/parler-tts

metavoiceio/metavoice-src

lucidrains/self-rewarding-lm-pytorch

PolyAI-LDN/conversational-datasets

0nutation/SpeechGPT

Vaibhavs10/open-tts-tracker

k2-fsa/icefall

ga642381/speech-trident

ZhangXInFD/SpeechTokenizer

KdaiP/StableTTS

metame-ai/awesome-audio-plaza

Rongjiehuang/GenerSpeech

huggingface/dataspeech

dubverse-ai/MahaTTS

PolyAI-LDN/pheme

fishaudio/audio-preprocess

CODEJIN/NaturalSpeech2

neonbjb/DL-Art-School

0nutation/USLM

AudiogenAI/agc

uniaudio666/UniAudio

NVIDIA/RAD-MMM

ex3ndr/supervoice-gpt

scutcsq/Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction

innnky/descript-audio-vae

jaehyeongAN/RedisAI-demo