Pinned Repositories
AutoVocoder
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models, a darkhorse in the field of Generative Models
Bert-VITS2
vits2 backbone with bert
bwe_historical_recordings-fork-
Bandwidth Extension of Historical Recordings using Generative Adversarial Networks ( BEHM-GAN )
golf
A DDSP-based neural vocoder.
soundstorm-pytorch
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
tacospawn
PyTorch implementation of TacoSpawn, Speaker Generation
unconditional-diff-STFT
Unconditional music synthesis using a diffusion model in the STFT domain
VITS2_pytorch_fork_-p0p4
unofficial VITS vits2-TTS implementation in pytorch
WaveletAttention
Wavelet-Attention CNNs for Image Classification
shaun95's Repositories
shaun95/AdaIN-style
Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization
shaun95/soft-dtw
Python implementation of soft-DTW.
shaun95/tacotron_soobinseo
Tacotron1 Pytorch implementation
shaun95/AudioNotebooks
Collection of notebooks and scripts related to audio processing and machine learning.
shaun95/audiotsm_A-python-library-for-real-time-audio-time-scale-modification-
A python library for real-time audio time-scale modification procedures
shaun95/Cuda-Matrix-Multiplication
Matrix Multiplication on GPU using Shared Memory considering Coalescing and Bank Conflicts
shaun95/dctts-pytorch
The pytorch implementation of DC-TTS
shaun95/dctts2
Deep Convolution Text to Speech
shaun95/FilterBanks_FastPythonImplementation
Filter Banks, Fast Python Implementation
shaun95/General-CUDA-programming
shaun95/homomorphic-filter
Implementation of homomorphic filter made in Python
shaun95/libstft
A tiny C library for short-time Fourier transforms
shaun95/lstm-cuda
This is a c++ implementation of an LSTM Neural Network parallelized for a GPU using CUDA
shaun95/parallel_wavenet_vocoder
shaun95/pggan-pytorch
Progressively Growing GAN in PyTorch for Image and Sound generation
shaun95/PhaseVocoder
Phase Vocoder : Modify the speed and pitch of a given audio file.
shaun95/PhaseVocoder-1
A cross platform command line utility allowing for time expansion/compression, pitch shifting and resampling of audio.
shaun95/prog_gans_pytorch_inference
PyTorch inference for "Progressive Growing of GANs" with CelebA snapshot
shaun95/progressive_growing_of_gans
Progressive Growing of GANs for Improved Quality, Stability, and Variation
shaun95/Pytorch-TDNN
shaun95/pytorch_TDNN
Time Delayed NN implemented in pytorch
shaun95/serge
shaun95/soundtouch
SoundTouch library compiled for ijkplayer/Android http://www.surina.net/soundtouch/sourcecode.html
shaun95/super-resolution
collection of super-resolution models & algorithms
shaun95/tacotron_kaituoxu
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
shaun95/transformer-xl
shaun95/TTS-Tacotron-Pytorch
Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.
shaun95/Voice-Conversion-GAN_VC
Voice Conversion using Cycle GAN's For Non-Parallel Data
shaun95/WaveGlow
A TensorFlow implementation of WaveGlow
shaun95/WaveRNN-Pytorch
Fatcord's Alternative WaveRNN (Faster training)