chl17

PhD Student@EPFL, Speech Processing

Idiap Research InstituteMartigny, Switzerland

chl17's Stars

suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook36.3k 329 4454.3k
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python32.6k 187 5633.5k
MetaCubeX/mihomo
A simple Python Pydantic model for Honkai: Star Rail parsed data from the Mihomo API.
Language:Python16.9k 100 1.2k2.7k
fishaudio/fish-speech
Brand new TTS solution
Language:Python14.7k 99 4121.1k
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
Language:Jupyter Notebook13.3k 174 5201.8k
ShiArthur03/ShiArthur03
Language:MATLAB10.4k 32 1.4k1.9k
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Language:Python10k 135 51863
VirgilClyne/iRingo
解锁完整的 Apple功能和集成服务
9.6k 87 0360
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Jupyter Notebook7.8k 76 216589
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Language:Jupyter Notebook7.7k 87 130749
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
Language:Jupyter Notebook7.1k 59 138482
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Language:Python5k 77 198422
serp-ai/bark-with-voice-clone
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
Language:Jupyter Notebook3.2k 49 80429
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
Language:Python3k 87 98419
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Language:Python2.5k 62 174266
GMvandeVen/continual-learning
PyTorch implementation of various methods for continual learning (XdG, EWC, SI, LwF, FROMP, DGR, BI-R, ER, A-GEM, iCaRL, Generative Classifier) in three different scenarios.
Language:Jupyter Notebook1.6k 28 30312
perixtar/2024-Tech-OA
List of Tech Company OAs. Save your time from finding them all over the internet.
1.5k 138 12100
microsoft/NeuralSpeech
Language:Python1.4k 33 125183
lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Language:Python1.3k 53 31101
MikeWang000000/PD-Runner-Revived
PD-Runner (Parallels Desktop) 补档
Language:Swift1.2k 46 35291
lingjzhu/CharsiuG2P
Multilingual G2P in 100 languages
Language:Jupyter Notebook288 10 1225
tarepan/SpeechMOS
Easy-to-Use Speech MOS predictors
Language:Python232 7 1516
p0p4k/pflowtts_pytorch
Unofficial implementation of NVIDIA P-Flow TTS paper
Language:Python217 14 4334
microsoft/P.808
This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).
Language:HTML210 21 2658
tfjgeorge/nngeometry
{KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch
Language:Python207 7 3320
hltcoe/turkle
Django-based clone of Amazon's Mechanical Turk service running in your local environment.
Language:Python147 17 12946
Thrandis/EKFAC-pytorch
Repository containing Pytorch code for EKFAC and K-FAC perconditioners.
Language:Python140 7 513
gmltmd789/UnitSpeech
An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"
Language:Jupyter Notebook133 11 913
HSU-ANT/beaqlejs
*BeaqleJS* provides a framework to create browser based listening tests and is purely based on open web standards like HTML5 and Javascript.
Language:JavaScript86 18 1449
QxLabIreland/listening-test
An open source platform for browser based speech and audio subjective quality tests.
Language:TypeScript32 4 66

chl17

chl17's Stars

suno-ai/bark

2noise/ChatTTS

MetaCubeX/mihomo

fishaudio/fish-speech

neonbjb/tortoise-tts

ShiArthur03/ShiArthur03

AIGC-Audio/AudioGPT

VirgilClyne/iRingo

open-mmlab/Amphion

jasonppy/VoiceCraft

cloneofsimo/lora

yl4579/StyleTTS2

serp-ai/bark-with-voice-clone

enhuiz/vall-e

lucidrains/audiolm-pytorch

GMvandeVen/continual-learning

perixtar/2024-Tech-OA

microsoft/NeuralSpeech

lucidrains/naturalspeech2-pytorch

MikeWang000000/PD-Runner-Revived

lingjzhu/CharsiuG2P

tarepan/SpeechMOS

p0p4k/pflowtts_pytorch

microsoft/P.808

tfjgeorge/nngeometry

hltcoe/turkle

Thrandis/EKFAC-pytorch

gmltmd789/UnitSpeech

HSU-ANT/beaqlejs

QxLabIreland/listening-test