vietvq1511

vietvq1511's Stars

smacke/ffsubsync
Automagically synchronize subtitles with video.
Language:Python6.8k281
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
3.5k212
openai/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
Language:Python1.2k163
huggingface/diffusion-models-class
Materials for the Hugging Face Diffusion Models Course
Language:Jupyter Notebook3.7k395
kmario23/deep-learning-drizzle
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Language:HTML12.3k2.9k
GeostatsGuy/Resources
Inventory of all the educational content that I share on spatial data analytics, geostatistics and machine learning. I hope these resources are helpful, Prof. Michael Pyrcz
37748
georgian-io/Multimodal-Toolkit
Multimodal model for text and tabular data with HuggingFace transformers as building block for text data
Language:Python59185
makcedward/nlpaug
Data augmentation for NLP
Language:Jupyter Notebook4.5k463
bradyneal/causal-book-code
Language:Python7619
ducanhdt/Movie_Controller
Chrome Extention to control youtube video by hand gesture
Language:CSS1
dhimasryan/MOSA-Net-Cross-Domain
Language:Python4810
andabi/deep-voice-conversion
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
Language:Python3.9k844
rishikksh20/Bidirectional-LEM-pytorch
Pytorch Implementation of Bidirectional Long Expressive Memory
Language:Python91
sony/sqvae
Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)
Language:Python18121
kenders2000/distortionDetection
C++ Program to detect Clipping and other overload based nonlinear distortions in Wav Files
Language:C305
KunZhou9646/Emovox
This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".
Language:Python8211
fuzhenxin/Style-Transfer-in-Text
Paper List for Style Transfer in Text
1.6k194
magenta/ddsp
DDSP: Differentiable Digital Signal Processing
Language:Python2.9k339
sarulab-speech/UTMOS22
UT-Sarulab MOS prediction system using SSL models
Language:Python19014
satwikkansal/wtfpython
What the f*ck Python? 😱
Language:Python35.8k2.7k
gabrielmittag/NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
Language:Python694117
google/visqol
Perceptual Quality Estimator for speech and audio
Language:C++704125
SamuelBroughton/Mel-Cepstral-Distortion
Calculation of MCD (dB) between two speech waveforms
Language:Jupyter Notebook5714
Jackson-Kang/Prosody-augmentation-for-Text-to-speech
Simple tool for speech dataset augmentation for modeling various prosodies.
Language:Python14
dunky11/voicesmith
[WIP] VoiceSmith makes training text to speech models easy.
Language:Python22332
IMLHF/Speech-Enhancement-Measures
speech enhancement metrics：CSIG, CBAK, CMOS, SSNR, PESQ, STOI, ESTOI, SNR, IS, LLR, WSS
Language:MATLAB5922
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Language:Python11.2k1.9k
Wendison/VQMIVC
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
Language:Jupyter Notebook34055
yl4579/StyleTTS
Official Implementation of StyleTTS
Language:Python40264
uwdata/visualization-curriculum
A data visualization curriculum of interactive notebooks.
Language:Jupyter Notebook1.3k261