yuan1615's Stars
AntixK/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
TencentGameMate/chinese_speech_pretrain
chinese speech pretrained models
ddlBoJack/Awesome-Speech-Pretraining
Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.
LinuxSuRen/remote-jobs-in-china
支持远程办公的**公司
wenet-e2e/wetts
Production First and Production Ready End-to-End Text-to-Speech Toolkit
HLTSingapore/Emotional-Speech-Data
This is the GitHub page for publicly available emotional speech data.
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
NVIDIA/BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
thunlp/OpenDelta
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)
thuhcsi/FlatTN
Chinese Text Normalization and Dataset
Rongjiehuang/FastDiff
PyTorch Implementation of FastDiff (IJCAI'22)
yeyupiaoling/PPASR
基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型
tts-tutorial/icassp2022
k2-fsa/k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
unilight/LDNet
Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"
ageitgey/face_recognition
The world's simplest facial recognition api for Python and the command line
dengxiuqi/ChineseLyrics
10W首中文歌词数据库
GitYCC/g2pW
Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)
coneypo/Dlib_face_recognition_from_camera
Detect and recognize the faces from camera / 调用摄像头进行人脸识别,支持多张人脸同时识别
xiangyuecn/Recorder
html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式,支持pc和Android、iOS部分浏览器、Hybrid App(提供Android iOS App源码)、微信,提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码
cnlinxi/book-text-to-speech
A book about Text-to-Speech (TTS) in Chinese.
microsoft/NeuralSpeech
lochenchou/MOSNet
Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"
MoonInTheRiver/NeuralSVB
Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code
jaywalnut310/glow-tts
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
LEEYOONHYUNG/BVAE-TTS
Official implementation of BVAE-TTS
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
liusongxiang/ppg-vc
PPG-Based Voice Conversion
bilibili/ailab