Pinned Repositories
av_hubert
A self-supervised learning framework for audio-visual speech
DeepFilterNet
Noise supression using deep filtering
denoiser
(For 16kHz audio only) Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
ears_dataset
Expressive Anechoic Recordings of Speech (EARS)
FAST-RIR
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.
inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
pygsound
Impulse response generation based on state-of-the-art geometric sound propagation engine.
pyimagesource
Image-source method for room acoustics
RNNoise_Wrapper
A simple Python wrapper for audio noise reduction RNNoise. Simplifies work with it, adds new trained models and detailed instructions for training.
OscarLiau's Repositories
OscarLiau/av_hubert
A self-supervised learning framework for audio-visual speech
OscarLiau/DeepFilterNet
Noise supression using deep filtering
OscarLiau/denoiser
(For 16kHz audio only) Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
OscarLiau/DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
OscarLiau/ears_dataset
Expressive Anechoic Recordings of Speech (EARS)
OscarLiau/FAST-RIR
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.
OscarLiau/Google-Voice-Separation-voicefilter
Unofficial PyTorch implementation of Google AI's VoiceFilter system
OscarLiau/inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
OscarLiau/pygsound
Impulse response generation based on state-of-the-art geometric sound propagation engine.
OscarLiau/pyimagesource
Image-source method for room acoustics
OscarLiau/RNNoise_Wrapper
A simple Python wrapper for audio noise reduction RNNoise. Simplifies work with it, adds new trained models and detailed instructions for training.
OscarLiau/IIRNet
Direct design of biquad filter cascades with deep learning by sampling random polynomials.
OscarLiau/Learning_Neural_Acoustic_Fields
Official code for "Learning Neural Acoustic Fields"
OscarLiau/libsndfile
A C library for reading and writing sound files containing sampled audio data.
OscarLiau/makefiletutorial
Learn make by example
OscarLiau/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
OscarLiau/RIR-Generator
Generating room impulse responses
OscarLiau/sparsenet
OscarLiau/SpeechDenoisingWithDeepFeatureLosses
Speech Denoising with Deep Feature Losses
OscarLiau/tea-lab-web
Telecom Electroacoustic Audio Lab Website
OscarLiau/wayverb
This project is not under active development. Hybrid waveguide and raytracer for room acoustics on the GPU
OscarLiau/youtube-dl
Command-line program to download videos from YouTube.com and other video sites