sunnnnnnnny

Pinned Repositories

AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
Language:Python0 0 00
AdaSpeech
An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"
Language:Python0 0 00
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python0 0 00
attention_onnx_exp
0 1 00
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python0 0 00
Automatic-Prosody-Annotation
Language:Python0 0 00
AuxiliaryASR
Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)
Language:Python0 0 00
Awesome-Singing-Voice-Synthesis-and-Singing-Voice-Conversion
A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works (such as Music Synthesis, Automatic Music Transcription, Automatic MOS Prediction, SSL-based ASR...etc).
00
open-musiclm
Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.
Language:Python1 1 00
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python1 0 00

sunnnnnnnny's Repositories

sunnnnnnnny/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python1 0 00
sunnnnnnnny/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python0 0 00
sunnnnnnnny/attention_onnx_exp
0 1 00
sunnnnnnnny/AuxiliaryASR
Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)
Language:Python0 0 00
sunnnnnnnny/BBDown
Bilibili Downloader. 一款命令行式哔哩哔哩下载器.
Language:C#0 0 00
sunnnnnnnny/fs2_mfa_phone
Language:Python0 1 00
sunnnnnnnny/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Language:Python0 0 00
sunnnnnnnny/ChatTTS
ChatTTS is a generative speech model for daily dialogue.
Language:Jupyter Notebook0 0
sunnnnnnnny/DALLE-pytorch
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Language:Python0 0
sunnnnnnnny/emotion2vec
Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Language:Python0 0
sunnnnnnnny/fish-speech
Brand new TTS solution
Language:Python0 0
sunnnnnnnny/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python0 0
sunnnnnnnny/HierSpeechpp
The official implementation of HierSpeech++
Language:Python0 0
sunnnnnnnny/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
Language:Jupyter Notebook0 0
sunnnnnnnny/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Language:Jupyter Notebook0 0
sunnnnnnnny/megatts2
Unoffical implementation of Megatts2
Language:Python0 0
sunnnnnnnny/NS2VC
Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech
Language:Python0 0
sunnnnnnnny/onnx_for_loop
Language:Python1 0
sunnnnnnnny/OpenVoice
Instant voice cloning by MyShell
Language:Python0 0
sunnnnnnnny/PitchExtractor
Deep Neural Pitch Extractor for Voice Conversion and TTS Training
Language:Python0 0
sunnnnnnnny/stable-diffusion
A latent text-to-image diffusion model
Language:Jupyter Notebook0 0
sunnnnnnnny/StyleTTS
Official Implementation of StyleTTS
Language:Python0 0
sunnnnnnnny/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Language:Python0 0
sunnnnnnnny/Tacotron2-PyTorch
Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.
Language:Python0 0
sunnnnnnnny/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
Language:Jupyter Notebook0 0
sunnnnnnnny/Transformer-TTS
A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
Language:Python0 0
sunnnnnnnny/usual-problem
Language:Python1 0
sunnnnnnnny/versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
sunnnnnnnny/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python
sunnnnnnnny/XPhoneBERT
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
Language:Python0 0