heixiaoniu

heixiaoniu's Stars

lmnt-com/diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Language:Python778113
kaen2891/adversarial_fine-tuning_using_generated_respiratory_sound
(NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class Imbalance"
Language:Python171
canoalberto/imbalanced-streams
A survey on learning from imbalanced data streams: taxonomy, challenges, empirical study, and reproducible experimental framework
Language:Java238
NewComer00/chinese-pdf-ocr
🔎📖对中文PDF进行OCR | OCR for Chinese PDF file using API from DayBreak-u/chineseocr_lite
Language:JavaScript8615
xiaoman-zhang/KAD
Language:Python12310
CVxTz/COLA_pytorch
COLA contrastive pre-training method implemented in PyTorch
Language:Python424
leslievan/semi-utils
一个批量添加相机机型和拍摄参数的工具，后续「可能」添加其他功能。
Language:Python1.3k130
microsoft/UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
Language:Python43674
evelyn0414/OPERA
This is the official code release for OPERA: OPEn Respiratory Acoustic foundation models
Language:Python378
LSimon95/megatts2
Unoffical implementation of Megatts2
Language:Python26936
BITNP/BIThesis
📖 北京理工大学非官方 LaTeX 模板集合，包含本科、研究生毕业设计模板及更多。🎉 （更多文档请访问 wiki 和 release 中的手册）
Language:TeX71897
CODEJIN/Glow_TTS
An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.
Language:Python5312
0nutation/USLM
Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)
Language:Python13911
bshall/knn-vc
Voice Conversion With Just Nearest Neighbors
Language:Python45967
tuian/Books-2
21882
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python6.9k1.3k
ConsistencyVC/ConsistencyVC-voive-conversion
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
Language:Python13422
KunZhou9646/Emovox
This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".
Language:Python8211
psyai-net/EmoTalk_release
This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"
Language:Python35235
sh-lee-prml/HierSpeechpp
The official implementation of HierSpeech++
Language:Python1.2k136
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language:Python7.5k634
JSALT-2022-SSL/superb-prosody
Language:Python313
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
Language:Python30k3k
haoheliu/voicefixer
General Speech Restoration
Language:Python1.1k132
ddlBoJack/emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Language:Python66050
wenet-e2e/wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Language:Python739123
keonlee9420/Expressive-FastSpeech2
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
Language:Python28947
adelacvg/NS2VC
Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech
Language:Python23212
CODEJIN/NaturalSpeech2
Language:Jupyter Notebook14015
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Language:Python2.1k320