shirly-24

Pinned Repositories

Algorithm-Lin
高级算法期末作业
Language:Jupyter Notebook0 0 00
AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Language:Python0 0 00
audioldm_eval
This toolbox aims to unify audio generation model evaluation for easier comparison.
Language:Python0 0 00
AudioMAE
This repo hosts the code and models of "Masked Autoencoders that Listen".
Language:Python0 0 00
avocodo
Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)
Language:Python0 0 00
Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
0 0 00
Awesome-Singing-Voice-Synthesis-and-Singing-Voice-Conversion
A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works (such as Music Synthesis, Automatic Music Transcription, Automatic MOS Prediction, SSL-based ASR...etc).
0 0 00
awesome-voice-conversion
A curated list of awesome voice conversion, projects and communities.
0 0 00
BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
Language:Python0 0 00
UnGANable
Language:Python1 0 00

shirly-24's Repositories

shirly-24/UnGANable
Language:Python1 0 00
shirly-24/Algorithm-Lin
高级算法期末作业
Language:Jupyter Notebook0 0 00
shirly-24/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Language:Python0 0 00
shirly-24/audioldm_eval
This toolbox aims to unify audio generation model evaluation for easier comparison.
Language:Python0 0 00
shirly-24/AudioMAE
This repo hosts the code and models of "Masked Autoencoders that Listen".
Language:Python0 0 00
shirly-24/avocodo
Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)
Language:Python0 0 00
shirly-24/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
0 0 00
shirly-24/Awesome-Singing-Voice-Synthesis-and-Singing-Voice-Conversion
A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works (such as Music Synthesis, Automatic Music Transcription, Automatic MOS Prediction, SSL-based ASR...etc).
0 0 00
shirly-24/awesome-voice-conversion
A curated list of awesome voice conversion, projects and communities.
0 0 00
shirly-24/BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
Language:Python0 0 00
shirly-24/Conference-Accepted-Paper-List
Some Conferences' accepted paper lists (including AI, ML, Robotic)
0 0
shirly-24/CVPR2023-Papers-with-Code
CVPR 2023 论文和开源项目合集
0 0
shirly-24/ddsp_pytorch
Implementation of Differentiable Digital Signal Processing (DDSP) in Pytorch
Language:C0 0
shirly-24/deep-learning-for-image-processing
deep learning for image processing including classification and object-detection etc.
shirly-24/FA-GAN
Language:HTML1 0
shirly-24/FAKEBOB
Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)
Language:Python0 0
shirly-24/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Language:Python0 0
shirly-24/iSTFT-Avocodo-pytorch
Ultrafast GAN based Vocoder for Text to Speech
Language:Python0 0
shirly-24/LDL
Official implementation of the paper 'Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution' in CVPR 2022
Language:Python0 0
shirly-24/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Language:Python0 0
shirly-24/NeuralSpeech
Language:Python0 0
shirly-24/paper-reading
深度学习经典、新论文逐段精读
0 0
shirly-24/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Language:Jupyter Notebook0 0
shirly-24/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Language:Python0 0
shirly-24/s3prl
Audio Foundation Models (Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit)
Language:Python0 0
shirly-24/vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
Language:Python0 0