Pinned Repositories
Algorithm-Lin
高级算法期末作业
AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
audioldm_eval
This toolbox aims to unify audio generation model evaluation for easier comparison.
AudioMAE
This repo hosts the code and models of "Masked Autoencoders that Listen".
avocodo
Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)
Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
Awesome-Singing-Voice-Synthesis-and-Singing-Voice-Conversion
A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works (such as Music Synthesis, Automatic Music Transcription, Automatic MOS Prediction, SSL-based ASR...etc).
awesome-voice-conversion
A curated list of awesome voice conversion, projects and communities.
BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
UnGANable
shirly-24's Repositories
shirly-24/UnGANable
shirly-24/Algorithm-Lin
高级算法期末作业
shirly-24/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
shirly-24/audioldm_eval
This toolbox aims to unify audio generation model evaluation for easier comparison.
shirly-24/AudioMAE
This repo hosts the code and models of "Masked Autoencoders that Listen".
shirly-24/avocodo
Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)
shirly-24/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
shirly-24/Awesome-Singing-Voice-Synthesis-and-Singing-Voice-Conversion
A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works (such as Music Synthesis, Automatic Music Transcription, Automatic MOS Prediction, SSL-based ASR...etc).
shirly-24/awesome-voice-conversion
A curated list of awesome voice conversion, projects and communities.
shirly-24/BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
shirly-24/Conference-Accepted-Paper-List
Some Conferences' accepted paper lists (including AI, ML, Robotic)
shirly-24/CVPR2023-Papers-with-Code
CVPR 2023 论文和开源项目合集
shirly-24/ddsp_pytorch
Implementation of Differentiable Digital Signal Processing (DDSP) in Pytorch
shirly-24/deep-learning-for-image-processing
deep learning for image processing including classification and object-detection etc.
shirly-24/FA-GAN
shirly-24/FAKEBOB
Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)
shirly-24/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
shirly-24/iSTFT-Avocodo-pytorch
Ultrafast GAN based Vocoder for Text to Speech
shirly-24/LDL
Official implementation of the paper 'Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution' in CVPR 2022
shirly-24/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
shirly-24/NeuralSpeech
shirly-24/paper-reading
深度学习经典、新论文逐段精读
shirly-24/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
shirly-24/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
shirly-24/s3prl
Audio Foundation Models (Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit)
shirly-24/vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis