Pinned Repositories
acapellabot
Acapella Extraction with a ConvNet
asr-repr-analysis
Audio-Fingerprint-workshop
Audio fingerprint program for lab workshop
audio-visual-speech-enhancement
audio_conditioned_unet
Audio-Conditioned U-Net for Position Estimation in Full Sheet Images
awesome-semantic-segmentation
:metal: awesome-semantic-segmentation
ba-dls-deepspeech
Bayesian-Pitch-Tracking-Using-Harmonic-model
Fast Bayesian pitch tracking using the harmonic model
blow
Code to train and run Blow
vae-npvc
Re-implementation the code used in Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder
melspectrum007's Repositories
melspectrum007/audio_conditioned_unet
Audio-Conditioned U-Net for Position Estimation in Full Sheet Images
melspectrum007/Bayesian-Pitch-Tracking-Using-Harmonic-model
Fast Bayesian pitch tracking using the harmonic model
melspectrum007/blow
Code to train and run Blow
melspectrum007/cyclevae-vc
Non-Parallel Voice Conversion with Cyclic Variational Autoencoder
melspectrum007/DNP
Audio Denoising with Deep Network Priors
melspectrum007/fastF0Nls
C++ and MATLAB code for fast and accurate fundamental frequency estimation
melspectrum007/Harmony-Transformer
Deep learning model for chord recognition
melspectrum007/hf0
Hybrid f0 estimation using Convolutional Neural Network
melspectrum007/Image_Segmentation
pytorch Implementation of U-Net, R2U-Net, Attention U-Net, Attention R2U-Net.
melspectrum007/kaldi-onnx
Kaldi model converter to ONNX
melspectrum007/LPCNet
Efficient neural speech synthesis
melspectrum007/lws
Fast spectrogram phase recovery using Local Weighted Sums (C/Python/Matlab)
melspectrum007/melgan-neurips
melspectrum007/modeling-plate-spring-reverb
In order to listen to the audio examples, please go the website:
melspectrum007/MS-SNSD
The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.
melspectrum007/NMTGMinor
A Neural Machine Translation toolkit for research purpose
melspectrum007/onssen
An open-source speech separation and enhancement library
melspectrum007/OpenSeq2Seq
Toolkit for efficient experimentation with various sequence-to-sequence models
melspectrum007/phonet
Keras-based python framework to compute phonological posterior probabilities from audio files
melspectrum007/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding
melspectrum007/pySpeechRev
This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of acoustic impulse responses.
melspectrum007/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
melspectrum007/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
melspectrum007/spleeter
Deezer source separation library including pretrained models.
melspectrum007/SylNet
SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech
melspectrum007/transformer-xl
melspectrum007/two_step_mask_learning
A two step optimization for sound source separation on the adaptive front-end domain
melspectrum007/voice-conversion
melspectrum007/Waveglow_Inference_in_CUDA
C++ Code to run waveglow inference in cuda
melspectrum007/ZeroSpeech-TTS-without-T
A Pytorch implementation for the ZeroSpeech 2019 challenge.