enk100

Eliya Nachmani

Tel-Aviv University, GoogleTel Aviv

Pinned Repositories

Conv-TasNet
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
Language:Python0 1 00
Conv-TasNet-1
Language:Python0 1 00
demucs
Code for the paper Music Source Separation in the Waveform Domain
Language:Python0 1 00
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python2 1 00
Noise-Estimation-for-Generative-Diffusion-Models
Language:HTML2 2 00
Non-Gaussian-Denoising-Diffusion-Models
Language:HTML4 2 10
speaker_separation
speaker_separation
Language:HTML14 5 03
Unsupervised_Singing_Voice_Conversion
Language:HTML8 2 13
HyperNetworkDecoder
Hyper Graph Network Decoders for Block Codes
Language:Python21 5 46
svoice
We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.
Language:Python1.3k 24 95181

enk100's Repositories

enk100/speaker_separation
speaker_separation
Language:HTML14 5 03
enk100/Unsupervised_Singing_Voice_Conversion
Language:HTML8 2 13
enk100/Non-Gaussian-Denoising-Diffusion-Models
Language:HTML4 2 10
enk100/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python2 1 00
enk100/Noise-Estimation-for-Generative-Diffusion-Models
Language:HTML2 2 00
enk100/Conv-TasNet
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
Language:Python0 1 00
enk100/Conv-TasNet-1
Language:Python0 1 00
enk100/demucs
Code for the paper Music Source Separation in the Waveform Domain
Language:Python0 1 00
enk100/Hyper-Graph-Network-Decoders-for-Block-Codes
Hyper-Graph-Network Decoders for Block Codes
2 0
enk100/img2midi
Converts image files to midi songs
Language:Python1 0
enk100/OpenShadingLanguage
Advanced shading language for production GI renderers
Language:C++1 0
enk100/polyglot
Language:HTML1 0
enk100/SimulTron
Language:HTML

enk100

Pinned Repositories

Conv-TasNet

Conv-TasNet-1

demucs

denoiser

Noise-Estimation-for-Generative-Diffusion-Models

Non-Gaussian-Denoising-Diffusion-Models

speaker_separation

Unsupervised_Singing_Voice_Conversion

HyperNetworkDecoder

svoice

enk100's Repositories

enk100/speaker_separation

enk100/Unsupervised_Singing_Voice_Conversion

enk100/Non-Gaussian-Denoising-Diffusion-Models

enk100/denoiser

enk100/Noise-Estimation-for-Generative-Diffusion-Models

enk100/Conv-TasNet

enk100/Conv-TasNet-1

enk100/demucs

enk100/Hyper-Graph-Network-Decoders-for-Block-Codes

enk100/img2midi

enk100/OpenShadingLanguage

enk100/polyglot

enk100/SimulTron