aaaqeczyh

aaaqeczyh's Stars

2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python35.3k 193 6213.8k
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python11.6k 69 114730
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language:Python7.8k 67 161668
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Language:Python5.3k 56 265523
andrewyng/translation-agent
Language:Python5.3k 59 20637
megvii-research/NAFNet
The state-of-the-art image restoration model without nonlinear activation functions.
Language:Python2.4k 23 156308
jik876/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Language:Python2.1k 31 165527
swz30/Restormer
[CVPR 2022--Oral] Restormer: Efficient Transformer for High-Resolution Image Restoration. SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.
Language:Python2k 17 106251
LAION-AI/CLAP
Contrastive Language-Audio Pretraining
Language:Python1.6k 29 96157
haoheliu/voicefixer
General Speech Restoration
Language:Python1.1k 17 59134
TencentGameMate/chinese_speech_pretrain
chinese speech pretrained models
Language:Shell1.1k 10 5790
descriptinc/melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
Language:Python995 58 46217
NVIDIA/BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
Language:Python979 68 0124
gemelo-ai/vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
Language:Python898 31 57107
lmnt-com/diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Language:Python813 21 48116
jitsi/jiwer
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Language:Python703 16 54101
pltrdy/rouge
A full Python Implementation of the ROUGE Metric (not a wrapper)
Language:Python687 8 49102
TaoRuijie/ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Language:Python658 4 84119
OlaWod/FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
Language:Python657 19 89114
seungwonpark/melgan
MelGAN vocoder (compatible with NVIDIA/tacotron2)
Language:Python644 29 60115
sp-uhh/sgmse
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
Language:Python582 11 6681
haoheliu/voicefixer_main
General Speech Restoration
Language:Python276 11 1956
google-research-datasets/cvss
CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus
194 14 214
AndreevP/wvmos
MOS score prediction by fine-tuned wav2vec2.0 model
Language:Python155 5 522
YouTaoBaBa/Chinese-Dialogue-Dataset
用于汇总目前的开源中文对话数据集
142 2 112
epfl-dlab/llm-latent-language
Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".
Language:Jupyter Notebook71 3 316
fpaissan/tinyCLAP
Implementation of tinyCLAP.
Language:Python25 3 12
sunzewei2715/Doc2Doc_NMT
The repository for the paper: Rethinking Document-level Neural Machine Translation
Language:Python25 5 115
google/df-conformer
Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.
Language:HTML20 2 14
PKU-ONELab/Themis
The official repository for our EMNLP 2024 paper, Themis: A Reference-free NLG Evaluation Language Model with Flexibility and Interpretability.
Language:Python19 0 11