vietvq1511's Stars
smacke/ffsubsync
Automagically synchronize subtitles with video.
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
openai/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
huggingface/diffusion-models-class
Materials for the Hugging Face Diffusion Models Course
kmario23/deep-learning-drizzle
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
GeostatsGuy/Resources
Inventory of all the educational content that I share on spatial data analytics, geostatistics and machine learning. I hope these resources are helpful, Prof. Michael Pyrcz
georgian-io/Multimodal-Toolkit
Multimodal model for text and tabular data with HuggingFace transformers as building block for text data
makcedward/nlpaug
Data augmentation for NLP
bradyneal/causal-book-code
ducanhdt/Movie_Controller
Chrome Extention to control youtube video by hand gesture
dhimasryan/MOSA-Net-Cross-Domain
andabi/deep-voice-conversion
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
rishikksh20/Bidirectional-LEM-pytorch
Pytorch Implementation of Bidirectional Long Expressive Memory
sony/sqvae
Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)
kenders2000/distortionDetection
C++ Program to detect Clipping and other overload based nonlinear distortions in Wav Files
KunZhou9646/Emovox
This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".
fuzhenxin/Style-Transfer-in-Text
Paper List for Style Transfer in Text
magenta/ddsp
DDSP: Differentiable Digital Signal Processing
sarulab-speech/UTMOS22
UT-Sarulab MOS prediction system using SSL models
satwikkansal/wtfpython
What the f*ck Python? š±
gabrielmittag/NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
google/visqol
Perceptual Quality Estimator for speech and audio
SamuelBroughton/Mel-Cepstral-Distortion
Calculation of MCD (dB) between two speech waveforms
Jackson-Kang/Prosody-augmentation-for-Text-to-speech
Simple tool for speech dataset augmentation for modeling various prosodies.
dunky11/voicesmith
[WIP] VoiceSmith makes training text to speech models easy.
IMLHF/Speech-Enhancement-Measures
speech enhancement metricsļ¼CSIG, CBAK, CMOS, SSNR, PESQ, STOI, ESTOI, SNR, IS, LLR, WSS
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Wendison/VQMIVC
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
yl4579/StyleTTS
Official Implementation of StyleTTS
uwdata/visualization-curriculum
A data visualization curriculum of interactive notebooks.