MisakaMikoto96

Meow~ | Text-to-speech | USA

the University of Edinburgh常盘台

MisakaMikoto96's Stars

AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Language:Python141k 1.1k 7.7k26.7k
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python34.7k 286 1.1k4.2k
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python34.2k 205 1.3k3.9k
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
Language:Python29.2k 216 2442.9k
bmaltais/kohya_ss
Language:Python9.5k 93 2k1.2k
Plachtaa/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
Language:Python7.6k 82 152757
HumanAIGC/OutfitAnyone
Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person
5.6k 213 56427
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Language:Python4.9k 78 193404
Akegarasu/lora-scripts
LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.
Language:Python4.5k 28 483559
rosinality/vq-vae-2-pytorch
Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch
Language:Python1.6k 20 77271
lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Language:Python1.3k 53 3199
wangkai930418/awesome-diffusion-categorized
collection of diffusion model papers categorized by their subareas
1.2k 56 1660
NVIDIA/BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
Language:Python861 70 097
152334H/tortoise-tts-fast
Fast TorToiSe inference (5x or your money back!)
Language:Jupyter Notebook781 27 125179
lucidrains/MEGABYTE-pytorch
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch
Language:Python620 11 1552
OlaWod/FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
Language:Python595 19 85109
facebookresearch/AudioMAE
This repo hosts the code and models of "Masked Autoencoders that Listen".
Language:Python532 32 2844
ZhangXInFD/SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
Language:Python452 15 1540
adefossez/julius
Fast PyTorch based DSP for audio and 1D signals
Language:Python423 9 1125
v-iashin/SpecVQGAN
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
Language:Jupyter Notebook344 7 3539
rhasspy/gruut
A tokenizer, text cleaner, and phonemizer for many human languages.
Language:Python277 8 3636
innnky/ar-vits
text to speech using autoregressive transformer and VITS
Language:Python225 15 515
zyzisyz/mfa_conformer
Language:Python135 4 1314
X-LANCE/UniCATS-CTX-vec2wav
[AAAI 2024] Code for CTX-vec2wav in UniCATS
Language:Python119 10 916
ga642381/SpeechPrompt-v2
《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm
Language:Python81 6 64
rishikksh20/NaturalSpeech2
Language:Python70 13 03
elevenlabs/elevenlabs-docs
Documentation for elevenlabs.io/docs
Language:MDX60 17 174262
PlayVoice/BigVGAN
BigVGAN with Neural Source-Filter
Language:Python50 3 57
Pranjalya/tts-tortoise-gradio
A Gradio setup for Tortoise TTS.
Language:Python43 1 510
canberk17/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python1 0 00

MisakaMikoto96

MisakaMikoto96's Stars

AUTOMATIC1111/stable-diffusion-webui

coqui-ai/TTS

RVC-Boss/GPT-SoVITS

myshell-ai/OpenVoice

bmaltais/kohya_ss

Plachtaa/VALL-E-X

HumanAIGC/OutfitAnyone

yl4579/StyleTTS2

Akegarasu/lora-scripts

rosinality/vq-vae-2-pytorch

lucidrains/naturalspeech2-pytorch

wangkai930418/awesome-diffusion-categorized

NVIDIA/BigVGAN

152334H/tortoise-tts-fast

lucidrains/MEGABYTE-pytorch

OlaWod/FreeVC

facebookresearch/AudioMAE

ZhangXInFD/SpeechTokenizer

adefossez/julius

v-iashin/SpecVQGAN

rhasspy/gruut

innnky/ar-vits

zyzisyz/mfa_conformer

X-LANCE/UniCATS-CTX-vec2wav

ga642381/SpeechPrompt-v2

rishikksh20/NaturalSpeech2

elevenlabs/elevenlabs-docs

PlayVoice/BigVGAN

Pranjalya/tts-tortoise-gradio

canberk17/transformers