Pinned Repositories
advisor
Open-source implementation of Google Vizier for hyper parameters tuning
AEC-Challenge
AEC Challenge
AIR-ASVspoof
Implementation of the paper "One-class Learning Towards Synthetic Voice Spoofing Detection"
ALICE
Automatic LInguistic Unit Count Estimator (ALICE)
ASV-Anti-Spoofing-DADA
Dual-Adversarial Domain Adaptation for replay spoofing detection in automatic speaker verification.
ASV-anti-spoofing-with-Res2Net
Implementation of the paper: Replay and Synthetic Speech Detection with Res2Net architecture https://arxiv.org/abs/2010.15006
asvspoof2019
Our submission to the ASVspoof 2019: Automatic Speaker Verification Spoofing and Countermeasures Challenge
Audio-Classification-Models
Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.
Baby-Crying-Detection-Based-on-Audio-and-Video-Fusion
This is the dataset set and code of paper which name is Research of Infant Crying Detection Method Based on Audio and Video Fusion
vall-e
An unofficial PyTorch implementation of the audio LM VALL-E, WIP
dongsig's Repositories
dongsig/Build-SE-Dataset
Build speech enhancement dataset.
dongsig/CGMM-MVDR
Implementation of the CGMM-MVDR beamforming
dongsig/GMM1D
A simple example of one-dimensional Gaussian mixture model
dongsig/IRAPT
Instantaneous pitch estimation based on RAPT framework (EUSIPCO-2012)
dongsig/keras-serving
bring keras-models to production with tensorflow-serving and nodejs + docker :pizza:
dongsig/KWS-1
Keyword Spotting for detecting a word in an audio file
dongsig/music-auto_tagging-keras
Music auto-tagging models and trained weights in keras/theano
dongsig/nn-vad
simple dnn based vad
dongsig/NNAEC-NeuralNetworkbasedAcousticEchoCancellation
NNAEC-Neural Network based Acoustic Echo Cancellation
dongsig/Quality-Net
Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM. (Interspeech, 2018, with Travel Grants)
dongsig/rnnoise
Recurrent neural network for audio noise reduction
dongsig/Speech-enhancement
Deep neural network based speech enhancement toolkit
dongsig/Speech-keyword-verification
Verifying Deep Keyword Spotting Detection with Acoustic Word Embeddings
dongsig/Speech_Enhancement_DNN_NMF
Speech Enhancement based on DNN (Spectral-Mapping, TF-Masking), DNN-NMF, NMF
dongsig/VAD
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
dongsig/VAD-python
Voice Activity Detector in Python
dongsig/VisemeNet_tensorflow
dongsig/voiceProfile-for-gender-age-classify
Two Keras models for child/adult & man/woman classify use speech in Python.