MisakaMikoto96's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
d2l-ai/d2l-zh
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
mli/paper-reading
深度学习经典、新论文逐段精读
lucidrains/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
google/lyra
A Very Low-Bitrate Codec for Speech Compression
lucidrains/musiclm-pytorch
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
archinetai/audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
LAION-AI/CLAP
Contrastive Language-Audio Pretraining
facebookresearch/simsiam
PyTorch implementation of SimSiam https//arxiv.org/abs/2011.10566
YuanGongND/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
k2-fsa/icefall
jia-zhuang/pytorch-multi-gpu-training
整理 pytorch 单机多 GPU 训练方法与原理
AndreyGuzhov/AudioCLIP
Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)
LAION-AI/audio-dataset
Audio Dataset for training CLAP and other models
huawei-noah/Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
ZhuiyiTechnology/t5-pegasus
中文生成式预训练模型
jefflai108/Contrastive-Predictive-Coding-PyTorch
Contrastive Predictive Coding for Automatic Speaker Verification
microsoft/CLAP
Learning audio concepts from natural language supervision
renmada/t5-pegasus-pytorch
wesbz/SoundStream
This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf
yangdongchao/Text-to-sound-Synthesis
The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"
b04901014/MQTTS
yangdongchao/InstructTTS
The deme page of InstructTTS
janfreyberg/pytorch-revgrad
A minimal pytorch package implementing a gradient reversal layer.
juanalonso/diffusion-audio
Lista de modelos y aplicaciones basadas en diffusion