adiyoss

HUJI & FAIRTel Aviv

Pinned Repositories

AutoVowelDuration
Automatic Measurement of Vowel Duration for Consonant Vowel Consonant (CVC) sound files (JASA 2016)
Language:Python14 7 03
DeepAnomaly
Recurrent Neural Networks for Anomaly Detection using Time Series Data
Language:Python21 6 010
DeepSegmentor
Sequence Segmentation using Joint RNN and Structured Prediction Models (ICASSP 2017)
Language:Python17 5 03
GCommandsPytorch
ConvNets for Audio Recognition using Google Commands Dataset
Language:Python70 4 221
WatermarkNN
Watermarking Deep Neural Networks (USENIX 2018)
Language:Python93 3 331
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Jupyter Notebook21.3k 211 3992.2k
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python1.7k 36 149302
speech-resynthesis
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.
Language:Python398 19 2057
svoice
We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.
Language:Python1.3k 24 96187
textlesslib
Library for Textless Spoken Language Processing
Language:Python530 16 2451

adiyoss's Repositories

adiyoss/WatermarkNN
Watermarking Deep Neural Networks (USENIX 2018)
Language:Python93 3 331
adiyoss/GCommandsPytorch
ConvNets for Audio Recognition using Google Commands Dataset
Language:Python70 4 221
adiyoss/DeepAnomaly
Recurrent Neural Networks for Anomaly Detection using Time Series Data
Language:Python21 6 010
adiyoss/DeepSegmentor
Sequence Segmentation using Joint RNN and Structured Prediction Models (ICASSP 2017)
Language:Python17 5 03
adiyoss/AutoVowelDuration
Automatic Measurement of Vowel Duration for Consonant Vowel Consonant (CVC) sound files (JASA 2016)
Language:Python14 7 03
adiyoss/StructED
Risk Minimization Algorithms in Structured Prediction (JMLR 2016)
Language:Java13 4 12
adiyoss/Chroma
Pitch and chroma implementation in java
Language:Java7 2 02
adiyoss/DeepVOT
Automatic Measurement of Voice Onset Time (VOT) using Deep Recurrent Neural Networks (Interspeech 2016)
Language:Lua6 3 38
adiyoss/InDepth-Analysis
Sentence Representation Analysis
Language:Python2 2 0
adiyoss/colman_ml
ML course @ colman
Language:Jupyter Notebook1 2 04
adiyoss/DeepWDM
Recurrent Neural Networks for Word Duration Measurement
Language:Python1 2 12
adiyoss/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python1 1 0
adiyoss/diffq
DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight or group of weight, in order to achieve a given trade-off between the model size and accuracy.
Language:Python1 0 0
adiyoss/Expresso
Expresso dataset demo page
Language:HTML1 1 0
adiyoss/Tools-to-Design-or-Visualize-Architecture-of-Neural-Network
Tools to Design or Visualize Architecture of Neural Network
1 1 01
adiyoss/adiyoss.github.io
Personal website
Language:HTML1 0
adiyoss/audio-cont
Language:HTML1 0
adiyoss/dataset
2 0
adiyoss/dotfiles
dotfiles for vim, tmux, etc.
Language:Shell1 0
adiyoss/dsVAE-NES
1 0
adiyoss/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python0 0
adiyoss/griffin_lim
Implementation of the Griffin and Lim algorithm to recover an audio signal from a magnitude-only spectrogram.
Language:Python2 0
adiyoss/iTerm2-Color-Schemes
Over 150 terminal color schemes/themes for iTerm/iTerm2 (with ports to Terminal, Konsole, PuTTY, Xresources, XRDB, and Terminator)
2 0
adiyoss/nltk_contrib
NLTK Contrib
Language:Python1 0
adiyoss/OpenNMT
Open-Source Neural Machine Translation in Torch
Language:Lua3 0
adiyoss/py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
Language:C1 0
adiyoss/pytorch-stft
An STFT/iSTFT for PyTorch.
Language:Python2 0
adiyoss/StarGAN
PyTorch Implementation of StarGAN - CVPR 2018
Language:Python3 0
adiyoss/turk
2 0
adiyoss/wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
Language:C++2 0