chl17's Stars
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
2noise/ChatTTS
A generative speech model for daily dialogue.
MetaCubeX/mihomo
A simple Python Pydantic model for Honkai: Star Rail parsed data from the Mihomo API.
fishaudio/fish-speech
Brand new TTS solution
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
ShiArthur03/ShiArthur03
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
VirgilClyne/iRingo
解锁完整的 Apple功能和集成服务
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
serp-ai/bark-with-voice-clone
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
GMvandeVen/continual-learning
PyTorch implementation of various methods for continual learning (XdG, EWC, SI, LwF, FROMP, DGR, BI-R, ER, A-GEM, iCaRL, Generative Classifier) in three different scenarios.
perixtar/2024-Tech-OA
List of Tech Company OAs. Save your time from finding them all over the internet.
microsoft/NeuralSpeech
lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
MikeWang000000/PD-Runner-Revived
PD-Runner (Parallels Desktop) 补档
lingjzhu/CharsiuG2P
Multilingual G2P in 100 languages
tarepan/SpeechMOS
Easy-to-Use Speech MOS predictors
p0p4k/pflowtts_pytorch
Unofficial implementation of NVIDIA P-Flow TTS paper
microsoft/P.808
This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).
tfjgeorge/nngeometry
{KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch
hltcoe/turkle
Django-based clone of Amazon's Mechanical Turk service running in your local environment.
Thrandis/EKFAC-pytorch
Repository containing Pytorch code for EKFAC and K-FAC perconditioners.
gmltmd789/UnitSpeech
An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"
HSU-ANT/beaqlejs
*BeaqleJS* provides a framework to create browser based listening tests and is purely based on open web standards like HTML5 and Javascript.
QxLabIreland/listening-test
An open source platform for browser based speech and audio subjective quality tests.