Pinned Repositories
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
bark
🔊 Text-prompted Generative Audio Model
fish-diffusion
An easy to understand TTS / SVS / SVC framework
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Stable-Diffusion
Best Stable Diffusion and AI Tutorials, Guides, News, Tips and Tricks
stable-diffusion-webui
Stable Diffusion web UI
STIT
tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
tortoise-tts-fast
Fast TorToiSe inference (5x or your money back!)
wav2lip-hq-updated-ESRGAN
Updated fork of wav2lip-hq allowing for the use of current ESRGAN models
Mrc2023's Repositories
Mrc2023/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Mrc2023/bark
🔊 Text-prompted Generative Audio Model
Mrc2023/DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
Mrc2023/fish-diffusion
An easy to understand TTS / SVS / SVC framework
Mrc2023/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Mrc2023/Stable-Diffusion
Best Stable Diffusion and AI Tutorials, Guides, News, Tips and Tricks
Mrc2023/stable-diffusion-webui
Stable Diffusion web UI
Mrc2023/STIT
Mrc2023/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Mrc2023/tortoise-tts-fast
Fast TorToiSe inference (5x or your money back!)
Mrc2023/wav2lip-hq-updated-ESRGAN
Updated fork of wav2lip-hq allowing for the use of current ESRGAN models
Mrc2023/whisper
Robust Speech Recognition via Large-Scale Weak Supervision