Pinned Repositories
3D-Machine-Learning
A resource repository for 3D machine learning
acoustic-model
Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
awesome-python
A curated list of awesome Python frameworks, libraries, software and resources
Awesome-pytorch-list
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling
Bert-VITS2
vits2 backbone with multilingual-bert
bookshelf
:books: books
youtube-audio-and-transcript-extract
split audio_segmentation with corresponding transcript for youtube datasets
MuruganR96's Repositories
MuruganR96/Bert-VITS2
vits2 backbone with multilingual-bert
MuruganR96/ConsistencyVC-voive-conversion
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
MuruganR96/DDSP-SVC
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
MuruganR96/Diffusion-SVC
MuruganR96/e2-tts-pytorch
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
MuruganR96/fish-diffusion
An easy to understand TTS / SVS / SVC framework
MuruganR96/generative-ai-for-beginners
12 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
MuruganR96/glow-svc
singing voice conversion based on glow-tts
MuruganR96/Grad-SVC
Singing Voice Conversion based on Grad-TTS. The core algorithm is diffusion.
MuruganR96/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
MuruganR96/hifigan-yingram-vc
vc
MuruganR96/knn-vc
Voice Conversion With Just Nearest Neighbors
MuruganR96/lora-svc
singing voice change based on whisper, and lora for singing voice clone
MuruganR96/moshi
MuruganR96/MyArxiv
MuruganR96/PitchVC
PitchVC: Pitch Conditioned Any-to-Many Voice Conversion
MuruganR96/pits
PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
MuruganR96/PPG-GradVC
A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis
MuruganR96/QuickVC-VoiceConversion
QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
MuruganR96/Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
MuruganR96/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
MuruganR96/so-vits-svc
基于vits与softvc的歌声音色转换模型
MuruganR96/so-vits-svc-1
SoftVC VITS Singing Voice Conversion
MuruganR96/so-vits-svc-2
SoftVC VITS Singing Voice Conversion
MuruganR96/so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
MuruganR96/StyleTTS-VC
Official Implementation of StyleTTS-VC
MuruganR96/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
MuruganR96/ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
MuruganR96/voice-changer
MuruganR96/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild