LuMiaMia

LuMiaMia's Stars

kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Language:Shell14.1k 695 1.6k5.3k
librosa/librosa
Python library for audio and music analysis
Language:Python7k 136 1.2k956
affinelayer/pix2pix-tensorflow
Tensorflow port of Image-to-Image Translation with Conditional Adversarial Nets https://phillipi.github.io/pix2pix/
Language:JavaScript5.1k 179 1851.3k
xiph/rnnoise
Recurrent neural network for audio noise reduction
Language:C4k 149 198889
jameslyons/python_speech_features
This library provides common speech features for ASR including MFCCs and filterbank energies.
Language:Python2.4k 88 71618
LCAV/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
Language:Python1.4k 44 223424
pykaldi/pykaldi
A Python wrapper for Kaldi
Language:Python991 42 277248
jtkim-kaist/VAD
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Language:MATLAB835 45 40232
introlab/odas
ODAS: Open embeddeD Audition System
Language:C778 55 260247
acoular/acoular
Acoular - Acoustic testing and source mapping software
Language:Python431 28 204123
ehabets/RIR-Generator
Generating room impulse responses
Language:C++416 18 8145
shichaog/WebRTC-audio-processing
webrtc audio processing
Language:C++375 24 5136
posenhuang/deeplearningsourceseparation
Deep Recurrent Neural Networks for Source Separation
Language:MATLAB365 33 25135
xanguera/BeamformIt
BeamformIt acoustic beamforming software
Language:C++342 33 22111
Baidu-AIP/speech-vad-demo
集成Webrtc的VAD，用于切分音频文件
Language:C336 18 16125
yongxuUSTC/sednn
deep learning based speech enhancement using keras or pytorch, make it easy to use
Language:Python333 19 58125
fgnt/nn-gev
Neural network supported GEV beamformer
Language:Python193 14 1490
helianvine/fdndlp
A speech dereverberation algorithm, also called wpe
Language:Python147 11 459
funcwj/CGMM-MVDR
Implementation of the CGMM-MVDR beamforming (for python version please refer to https://github.com/funcwj/setk)
Language:Python139 7 355
DistantSpeechRecognition/mcse
Multi-channel speech enhancement system (MVDR beamformer + several postfilters)
Language:Python99 10 153
jcsilva/deep-clustering
Language:Python71 12 335
xuchenglin28/WSCM-MUSIC
Weighted Spatial Covariance Matrix Estimation for MUSIC based TDOA Estimation of Speech Source
Language:MATLAB64 4 023
jacoxu/ASAM
This is the code&dataset for our paper [Modeling Attention and Memory for Auditory Selection in a Cocktail Party Environment. AAAI 2018]
Language:Python55 5 220
LCAV/AcousticRakeReceiver
The acoustic rake receiver, a microphone beamformer that uses echoes to improve the noise and interference suppression. Python code to reproduce all the results from Raking the Cocktail Party by Ivan Dokmanic, Robin Scheibler, and Martin Vetterli.
Language:Python53 17 018
ronw/matlab_htk
MATLAB functions that interface with the HTK Speech Recognition Toolkit (http://htk.eng.cam.ac.uk/) for training HMMs, GMMs and simple speech recognizers.
Language:Matlab45 8 118
ehabets/ANF-Generator
Generating non-stationary multi-sensor signals under a spatial coherence constraint
Language:MATLAB43 2 313
ehabets/INF-Generator
Generating sensor signals in isotropic noise fields
Language:MATLAB43 1 221
jameslyons/matlab_speech_features
A set of speech feature extraction functions for ASR and speaker identification written in matlab.
Language:Matlab43 8 032
bscharan/Automatic-speech-sequence-segmentation
The Main Aim of this project is to segment and cluster an audio sample based on speaker when number of speakers are not known before hand. Main challenge in the process of speaker recognition is separting audio based on speaker.It can enhance the readability of an automatic speech transcription by structuring the audio stream into speaker turns and, when used together with speaker recognition systems, by providing the speaker's true identity.Other challenges are due to multiple speakers present at the time instant
Language:MATLAB23 2 07
adiyoss/DeepSegmentor
Sequence Segmentation using Joint RNN and Structured Prediction Models (ICASSP 2017)
Language:Python17 5 03