LuMiaMia's Stars
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
librosa/librosa
Python library for audio and music analysis
affinelayer/pix2pix-tensorflow
Tensorflow port of Image-to-Image Translation with Conditional Adversarial Nets https://phillipi.github.io/pix2pix/
xiph/rnnoise
Recurrent neural network for audio noise reduction
jameslyons/python_speech_features
This library provides common speech features for ASR including MFCCs and filterbank energies.
LCAV/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
pykaldi/pykaldi
A Python wrapper for Kaldi
jtkim-kaist/VAD
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
introlab/odas
ODAS: Open embeddeD Audition System
acoular/acoular
Acoular - Acoustic testing and source mapping software
ehabets/RIR-Generator
Generating room impulse responses
shichaog/WebRTC-audio-processing
webrtc audio processing
posenhuang/deeplearningsourceseparation
Deep Recurrent Neural Networks for Source Separation
xanguera/BeamformIt
BeamformIt acoustic beamforming software
Baidu-AIP/speech-vad-demo
集成Webrtc的VAD,用于切分音频文件
yongxuUSTC/sednn
deep learning based speech enhancement using keras or pytorch, make it easy to use
fgnt/nn-gev
Neural network supported GEV beamformer
helianvine/fdndlp
A speech dereverberation algorithm, also called wpe
funcwj/CGMM-MVDR
Implementation of the CGMM-MVDR beamforming (for python version please refer to https://github.com/funcwj/setk)
DistantSpeechRecognition/mcse
Multi-channel speech enhancement system (MVDR beamformer + several postfilters)
jcsilva/deep-clustering
xuchenglin28/WSCM-MUSIC
Weighted Spatial Covariance Matrix Estimation for MUSIC based TDOA Estimation of Speech Source
jacoxu/ASAM
This is the code&dataset for our paper [Modeling Attention and Memory for Auditory Selection in a Cocktail Party Environment. AAAI 2018]
LCAV/AcousticRakeReceiver
The acoustic rake receiver, a microphone beamformer that uses echoes to improve the noise and interference suppression. Python code to reproduce all the results from Raking the Cocktail Party by Ivan Dokmanic, Robin Scheibler, and Martin Vetterli.
ronw/matlab_htk
MATLAB functions that interface with the HTK Speech Recognition Toolkit (http://htk.eng.cam.ac.uk/) for training HMMs, GMMs and simple speech recognizers.
ehabets/ANF-Generator
Generating non-stationary multi-sensor signals under a spatial coherence constraint
ehabets/INF-Generator
Generating sensor signals in isotropic noise fields
jameslyons/matlab_speech_features
A set of speech feature extraction functions for ASR and speaker identification written in matlab.
bscharan/Automatic-speech-sequence-segmentation
The Main Aim of this project is to segment and cluster an audio sample based on speaker when number of speakers are not known before hand. Main challenge in the process of speaker recognition is separting audio based on speaker.It can enhance the readability of an automatic speech transcription by structuring the audio stream into speaker turns and, when used together with speaker recognition systems, by providing the speaker's true identity.Other challenges are due to multiple speakers present at the time instant
adiyoss/DeepSegmentor
Sequence Segmentation using Joint RNN and Structured Prediction Models (ICASSP 2017)