MisakaMikoto96

Meow~ | Text-to-speech | USA

the University of Edinburgh常盘台

MisakaMikoto96's Stars

Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python168k 1.6k 2.7k44.3k
PlexPt/awesome-chatgpt-prompts-zh
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
52.5k 342 9513.5k
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook35.7k 329 4394.2k
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
Language:Python25.7k 177 1304.8k
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python20.8k 202 3812.1k
EmbraceAGI/awesome-chatgpt-zh
ChatGPT 中文指南🔥，ChatGPT 中文调教指南，指令指南，应用开发指南，精选资源清单，更好的使用 chatGPT 让你的生产力 up up up! 🚀
Language:Python10.6k 110 13887
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Language:Python10k 134 50862
THUDM/VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
Language:Python4.1k 40 351416
collabora/WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
Language:Jupyter Notebook3.9k 77 109209
yuanzhoulvpi2017/zero_nlp
中文nlp解决方案(大模型、数据、模型、训练、推理)
Language:Jupyter Notebook2.9k 32 183359
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
Language:Python2.3k 23 329179
KaiyangZhou/CoOp
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
Language:Python1.7k 15 80194
microsoft/SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
Language:Python1.2k 24 86112
yzhuoning/Awesome-CLIP
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
1.1k 19 1255
TencentGameMate/chinese_speech_pretrain
chinese speech pretrained models
Language:Shell1k 10 5683
cjyaddone/ChatWaifu
Combined ChatGPT with Moegoe TTS to create a Chatting Waifu
Language:Python812 11 2189
yeyupiaoling/PPASR
基于PaddlePaddle实现端到端中文语音识别，从入门到实战，超简单的入门案例，超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型
Language:Python810 11 182129
gitmylo/bark-voice-cloning-HuBERT-quantizer
The code for the bark-voicecloning model. Training and inference.
Language:Python654 18 43109
clue-ai/PromptCLUE
PromptCLUE, 全中文任务支持零样本学习模型
Language:Jupyter Notebook650 9 1968
facebookresearch/WavAugment
A library for speech data augmentation in time-domain
Language:Python637 25 1757
yangdongchao/AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
Language:Python578 31 4080
auspicious3000/contentvec
speech self-supervised representations
Language:Python462 11 3036
sophiefy/Sovits
An unofficial implementation of the combination of Soft-VC and VITS
Language:Jupyter Notebook460 6 951
descriptinc/lyrebird-wav2clip
Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP
Language:Python324 11 1328
Rongjiehuang/GenerSpeech
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
Language:Python316 17 2845
hche11/VGGSound
VGGSound: A Large-scale Audio-Visual Dataset
Language:Python287 6 1832
microsoft/Pengi
An Audio Language model for Audio Tasks
Language:Python284 14 1315
aoifemcdonagh/audioset-processing
Toolkit for downloading and processing Google's AudioSet dataset.
Language:Jupyter Notebook159 3 641
Moon0316/T2A
Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023
Language:Python82 5 811
gitmylo/bark-data-gen
Create training data for training a voice cloner for bark text to speech.
Language:Jupyter Notebook44 3 410

MisakaMikoto96

MisakaMikoto96's Stars

Significant-Gravitas/AutoGPT

PlexPt/awesome-chatgpt-prompts-zh

suno-ai/bark

svc-develop-team/so-vits-svc

facebookresearch/audiocraft

EmbraceAGI/awesome-chatgpt-zh

AIGC-Audio/AudioGPT

THUDM/VisualGLM-6B

collabora/WhisperSpeech

yuanzhoulvpi2017/zero_nlp

webdataset/webdataset

KaiyangZhou/CoOp

microsoft/SpeechT5

yzhuoning/Awesome-CLIP

TencentGameMate/chinese_speech_pretrain

cjyaddone/ChatWaifu

yeyupiaoling/PPASR

gitmylo/bark-voice-cloning-HuBERT-quantizer

clue-ai/PromptCLUE

facebookresearch/WavAugment

yangdongchao/AcademiCodec

auspicious3000/contentvec

sophiefy/Sovits

descriptinc/lyrebird-wav2clip

Rongjiehuang/GenerSpeech

hche11/VGGSound

microsoft/Pengi

aoifemcdonagh/audioset-processing

Moon0316/T2A

gitmylo/bark-data-gen