xiangkanghuang's Stars
wiseman/py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
pranaymanocha/PerceptualAudio
Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAM
adrienchaton/PerceptualAudio_Pytorch
Pytorch implementation of "A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences", Pranay Manocha et al. - unofficial work in progress
xiph/LPCNet
Efficient neural speech synthesis
funcwj/setk
Tools for Speech Enhancement integrated with Kaldi
facebookresearch/CPC_audio
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
AberHu/Knowledge-Distillation-Zoo
Pytorch implementation of various Knowledge Distillation (KD) methods.
FLHonker/Awesome-Knowledge-Distillation
Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。
haitongli/knowledge-distillation-pytorch
A PyTorch implementation for exploring deep and shallow knowledge distillation (KD) experiments with flexibility
dkozlov/awesome-knowledge-distillation
Awesome Knowledge Distillation
JusperLee/Conv-TasNet
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement
cywang97/StreamingTransformer
cpuimage/WebRTC_NS_CPP
Noise Suppression Module Port From WebRTC
DavidDiazGuerra/gpuRIR
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
nanahou/Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
archiki/Robust-E2E-ASR
This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 2021.
santi-pdp/pase
Problem Agnostic Speech Encoder
SuperAI211/Realtime_AudioDenoise_EchoCancellation
breizhn/DTLN
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
DeepVAC/deepvac
PyTorch Project Specification.
Audio-WestlakeU/FullSubNet
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
madhavmk/Noise2Noise-audio_denoising_without_clean_training_data
Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoising networks using only noisy speech samples.
GregorR/rnnoise-models
Trained neural networks and requisite information and data for rnnoise-nu
ludlows/PESQ
PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)
athena-team/athena-signal
breizhn/DTLN-aec
This Repostory contains the pretrained DTLN-aec model for real-time acoustic echo cancellation.
rogalmic/vscode-bash-debug
Bash shell debugger extension for VSCode (based on bashdb)
awni/warp-ctc
Fast parallel CTC.