Moonmore

Moonmore's Stars

Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
19.7k 382 271.6k
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook13.9k 98 181.1k
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python8.5k 82 4481.1k
huggingface/parler-tts
Inference and training library for high-quality TTS models.
Language:Python4.8k 54 124497
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
Language:Python3.8k 41 160343
libAudioFlux/audioFlux
A library for audio and music analysis, feature extraction.
Language:C3k 34 16122
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Language:Python2.5k 42 110226
Vaibhavs10/open-tts-tracker
1.1k 65 1669
lenML/Speech-AI-Forge
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
Language:Python928 15 158124
jishengpeng/WavTokenizer
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
Language:Python925 22 5754
gemelo-ai/vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
Language:Python847 32 5797
ga642381/speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
796 44 348
sihyun-yu/REPA
Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Language:Python757 17 2838
google/visqol
Perceptual Quality Estimator for speech and audio
Language:C++719 27 72127
ddlBoJack/emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Language:Python694 17 5452
csteinmetz1/pyloudnorm
Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
Language:Python663 15 3657
AI-Hobbyist/Genshin_Datasets
Genshin Datasets For SVC/SVS/TTS
611 9 1637
yangdongchao/AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
Language:Python599 31 4080
facebookresearch/textlesslib
Library for Textless Spoken Language Processing
Language:Python530 16 2451
lucidrains/e2-tts-pytorch
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
Language:Python397 26 2437
huggingface/dataspeech
Language:Python320 13 1651
b04901014/MQTTS
Language:Python251 12 1136
innnky/ar-vits
text to speech using autoregressive transformer and VITS
Language:Python234 15 615
TongTong313/rectified-flow
从零手搓Flow Matching（Rectified Flow）
Language:Python217 5 011
maum-ai/phaseaug
ICASSP 2023 Accepted
Language:Python190 5 1214
scutcsq/Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction
Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (arXiv:2401.01498)
Language:Python58 8 24
asappresearch/simple-tts
Contains the code associated with the ICLR submission for our text-to-speech diffusion model
Language:Python50 4 22
nii-yamagishilab/PartialSpoof
Language:Jupyter Notebook41 3 85
omine-me/LaughterSegmentation
Latest laughter detection & segmentaion model. Paper: "Robust Laughter Segmentation with Automatic Diverse Data Synthesis", Interspeech 2024
Language:Python34 3 42
YangAi520/APCodec
Language:Python21 1 21

Moonmore

Moonmore's Stars

Hannibal046/Awesome-LLM

naklecha/llama3-from-scratch

SWivid/F5-TTS

huggingface/parler-tts

FunAudioLLM/SenseVoice

libAudioFlux/audioFlux

haoheliu/AudioLDM

Vaibhavs10/open-tts-tracker

lenML/Speech-AI-Forge

jishengpeng/WavTokenizer

gemelo-ai/vocos

ga642381/speech-trident

sihyun-yu/REPA

google/visqol

ddlBoJack/emotion2vec

csteinmetz1/pyloudnorm

AI-Hobbyist/Genshin_Datasets

yangdongchao/AcademiCodec

facebookresearch/textlesslib

lucidrains/e2-tts-pytorch

huggingface/dataspeech

b04901014/MQTTS

innnky/ar-vits

TongTong313/rectified-flow

maum-ai/phaseaug

scutcsq/Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction

asappresearch/simple-tts

nii-yamagishilab/PartialSpoof

omine-me/LaughterSegmentation

YangAi520/APCodec