MisakaMikoto96

Meow~ | Text-to-speech | USA

the University of Edinburgh常盘台

MisakaMikoto96's Stars

openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python69.8k 575 08.2k
d2l-ai/d2l-zh
《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
Language:Python62.7k 1.1k 011k
mli/paper-reading
深度学习经典、新论文逐段精读
26.7k 727 02.4k
lucidrains/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Language:Python11.1k 121 2101.1k
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language:Python4.5k 50 290471
google/lyra
A Very Low-Bitrate Codec for Speech Compression
Language:C++3.8k 113 126355
lucidrains/musiclm-pytorch
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
Language:Python3.1k 98 53254
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
Language:Python2.9k 87 97417
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Language:Python2.4k 42 107222
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Language:Python2.4k 62 170256
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Language:Python2k 49 126319
archinetai/audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
Language:Python1.9k 39 43167
LAION-AI/CLAP
Contrastive Language-Audio Pretraining
Language:Python1.4k 28 89133
facebookresearch/simsiam
PyTorch implementation of SimSiam https//arxiv.org/abs/2011.10566
Language:Python1.2k 12 46176
YuanGongND/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
Language:Jupyter Notebook1.1k 18 135211
k2-fsa/icefall
Language:Python910 48 650291
jia-zhuang/pytorch-multi-gpu-training
整理 pytorch 单机多 GPU 训练方法与原理
Language:Python760 5 884
AndreyGuzhov/AudioCLIP
Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)
Language:Python756 17 092
LAION-AI/audio-dataset
Audio Dataset for training CLAP and other models
Language:Python624 21 5853
huawei-noah/Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Language:Jupyter Notebook558 23 29115
ZhuiyiTechnology/t5-pegasus
中文生成式预训练模型
Language:Python555 3 4384
jefflai108/Contrastive-Predictive-Coding-PyTorch
Contrastive Predictive Coding for Automatic Speaker Verification
Language:Python479 4 2198
microsoft/CLAP
Learning audio concepts from natural language supervision
Language:Python468 14 2136
renmada/t5-pegasus-pytorch
Language:Python401 3 8061
wesbz/SoundStream
This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf
Language:Python347 10 1651
yangdongchao/Text-to-sound-Synthesis
The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"
Language:Python346 17 2734
b04901014/MQTTS
Language:Python253 12 1135
yangdongchao/InstructTTS
The deme page of InstructTTS
155 13 28
janfreyberg/pytorch-revgrad
A minimal pytorch package implementing a gradient reversal layer.
Language:Python154 3 414
juanalonso/diffusion-audio
Lista de modelos y aplicaciones basadas en diffusion
11 4 01

MisakaMikoto96

MisakaMikoto96's Stars

openai/whisper

d2l-ai/d2l-zh

mli/paper-reading

lucidrains/DALLE2-pytorch

CarperAI/trlx

google/lyra

lucidrains/musiclm-pytorch

enhuiz/vall-e

haoheliu/AudioLDM

lucidrains/audiolm-pytorch

lifeiteng/vall-e

archinetai/audio-diffusion-pytorch

LAION-AI/CLAP

facebookresearch/simsiam

YuanGongND/ast

k2-fsa/icefall

jia-zhuang/pytorch-multi-gpu-training

AndreyGuzhov/AudioCLIP

LAION-AI/audio-dataset

huawei-noah/Speech-Backbones

ZhuiyiTechnology/t5-pegasus

jefflai108/Contrastive-Predictive-Coding-PyTorch

microsoft/CLAP

renmada/t5-pegasus-pytorch

wesbz/SoundStream

yangdongchao/Text-to-sound-Synthesis

b04901014/MQTTS

yangdongchao/InstructTTS

janfreyberg/pytorch-revgrad

juanalonso/diffusion-audio