xiangkanghuang

xiangkanghuang's Stars

wiseman/py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
Language:C2k406
pranaymanocha/PerceptualAudio
Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAM
Language:Python35033
adrienchaton/PerceptualAudio_Pytorch
Pytorch implementation of "A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences", Pranay Manocha et al. - unofficial work in progress
Language:Python602
xiph/LPCNet
Efficient neural speech synthesis
Language:C1.1k295
funcwj/setk
Tools for Speech Enhancement integrated with Kaldi
Language:Python39592
facebookresearch/CPC_audio
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
Language:Python34762
AberHu/Knowledge-Distillation-Zoo
Pytorch implementation of various Knowledge Distillation (KD) methods.
Language:Python1.6k265
FLHonker/Awesome-Knowledge-Distillation
Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。
2.5k335
haitongli/knowledge-distillation-pytorch
A PyTorch implementation for exploring deep and shallow knowledge distillation (KD) experiments with flexibility
Language:Python1.8k342
dkozlov/awesome-knowledge-distillation
Awesome Knowledge Distillation
3.4k493
JusperLee/Conv-TasNet
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement
Language:Python41576
cywang97/StreamingTransformer
Language:Python27242
cpuimage/WebRTC_NS_CPP
Noise Suppression Module Port From WebRTC
Language:C5931
DavidDiazGuerra/gpuRIR
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
Language:Cuda47991
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6k756
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python8.6k1.4k
nanahou/Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Language:MATLAB706149
archiki/Robust-E2E-ASR
This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 2021.
Language:Python4610
santi-pdp/pase
Problem Agnostic Speech Encoder
Language:Python43987
SuperAI211/Realtime_AudioDenoise_EchoCancellation
Language:C++12025
breizhn/DTLN
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
Language:Python567160
DeepVAC/deepvac
PyTorch Project Specification.
Language:Python660104
Audio-WestlakeU/FullSubNet
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Language:Python537153
madhavmk/Noise2Noise-audio_denoising_without_clean_training_data
Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoising networks using only noisy speech samples.
Language:Jupyter Notebook17242
GregorR/rnnoise-models
Trained neural networks and requisite information and data for rnnoise-nu
Language:C25140
ludlows/PESQ
PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)
Language:C52097
athena-team/athena-signal
Language:C512193
breizhn/DTLN-aec
This Repostory contains the pretrained DTLN-aec model for real-time acoustic echo cancellation.
Language:Python26970
rogalmic/vscode-bash-debug
Bash shell debugger extension for VSCode (based on bashdb)
Language:Shell21726
awni/warp-ctc
Fast parallel CTC.
Language:Cuda3112