heixiaoniu's Stars
lmnt-com/diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
kaen2891/adversarial_fine-tuning_using_generated_respiratory_sound
(NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class Imbalance"
canoalberto/imbalanced-streams
A survey on learning from imbalanced data streams: taxonomy, challenges, empirical study, and reproducible experimental framework
NewComer00/chinese-pdf-ocr
🔎📖对中文PDF进行OCR | OCR for Chinese PDF file using API from DayBreak-u/chineseocr_lite
xiaoman-zhang/KAD
CVxTz/COLA_pytorch
COLA contrastive pre-training method implemented in PyTorch
leslievan/semi-utils
一个批量添加相机机型和拍摄参数的工具,后续「可能」添加其他功能。
microsoft/UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
evelyn0414/OPERA
This is the official code release for OPERA: OPEn Respiratory Acoustic foundation models
LSimon95/megatts2
Unoffical implementation of Megatts2
BITNP/BIThesis
📖 北京理工大学非官方 LaTeX 模板集合,包含本科、研究生毕业设计模板及更多。🎉 (更多文档请访问 wiki 和 release 中的手册)
CODEJIN/Glow_TTS
An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.
0nutation/USLM
Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)
bshall/knn-vc
Voice Conversion With Just Nearest Neighbors
tuian/Books-2
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
ConsistencyVC/ConsistencyVC-voive-conversion
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
KunZhou9646/Emovox
This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".
psyai-net/EmoTalk_release
This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"
sh-lee-prml/HierSpeechpp
The official implementation of HierSpeech++
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
JSALT-2022-SSL/superb-prosody
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
haoheliu/voicefixer
General Speech Restoration
ddlBoJack/emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
wenet-e2e/wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
keonlee9420/Expressive-FastSpeech2
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
adelacvg/NS2VC
Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech
CODEJIN/NaturalSpeech2
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html