WenzheLiu-Speech
Hi, I am Wenzhe Liu. I work for Kuaishou, and was employed by Tencent. focusing on generalized speech enhancement, audio codec and speech synthesis
TencentBeijing, China
Pinned Repositories
aac-datasets
Audio Captioning datasets for PyTorch.
ADSP_Tutorials
Advanced Signal Processing Notebooks and Tutorials
ai-audio-datasets
AI Audio Datasets 🎵. A list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.
awesome-speech-enhancement
speech enhancement\speech seperation\sound source localization
penguins-aicodec-demo
pyaec
simple and efficient python implemention of a series of adaptive filters (lms、nlms、rls、kalman、Frequency Domain Adaptive Filter、Partitioned-Block-Based Frequency Domain Adaptive Filter、Frequency Domain Kalman Filter、Partitioned-Block-Based Frequency Domain Kalman Filter) for acoustic echo cancellation.
Realtime_AudioDenoise_EchoCancellation
sound-source-localization-algorithm_DOA_estimation
关于语音信号声源定位DOA估计所用的一些传统算法
The-guidebook-of-speech-enhancement
wenzheliu-speech
WenzheLiu-Speech's Repositories
WenzheLiu-Speech/sound-source-localization-algorithm_DOA_estimation
关于语音信号声源定位DOA估计所用的一些传统算法
WenzheLiu-Speech/Realtime_AudioDenoise_EchoCancellation
WenzheLiu-Speech/ADSP_Tutorials
Advanced Signal Processing Notebooks and Tutorials
WenzheLiu-Speech/pyaec
simple and efficient python implemention of a series of adaptive filters (lms、nlms、rls、kalman、Frequency Domain Adaptive Filter、Partitioned-Block-Based Frequency Domain Adaptive Filter、Frequency Domain Kalman Filter、Partitioned-Block-Based Frequency Domain Kalman Filter) for acoustic echo cancellation.
WenzheLiu-Speech/ILRMA
MATLAB script of Independent Low-Rank Matrix Analysis (ILRMA)
WenzheLiu-Speech/JAECBF
WenzheLiu-Speech/deepfilter_implement_see_-networks-speakerfilter.py
Wenzhe Liu Notes: deep filter reproduction, see: 23_3090_speakerfilter_new_deepfilter_final_1024_new/networks/speakerfilter.py i.e. https://github.com/heshulin/23_3090_speakerfilter_new_deepfilter_final_1024_new/blob/86dd75cb9f7858b11e8adc0097da372f706c23a1/networks/speakerfilter.py#L103
WenzheLiu-Speech/DNS-Challenge-IACASlab9.github.io
WenzheLiu-Speech/eGeMAPS_estimator
WenzheLiu-Speech/Neural-Speech-Dereverberation
Machine and Deep Learning models for speech dereverberation
WenzheLiu-Speech/TFGAN-PLC
A Temporal-Spectral Generative Adversarial Network based End-to-end Packet Loss Concealment for Wideband Speech Transmission
WenzheLiu-Speech/TinyNeuralNetwork
WenzheLiu-Speech/Tutorial_Separation
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
WenzheLiu-Speech/AudioCodingTutorials
Audio Coding Notebooks and Tutorials
WenzheLiu-Speech/clarity_CEC1
1st Clarity Enhancement Challenge
WenzheLiu-Speech/COSPA
Complex-valued Spatial Autoencoders for Multichannel Speech Enhancement
WenzheLiu-Speech/Dialog_Corpus
用于训练中英文对话系统的语料库 Datasets for Training Chatbot System
WenzheLiu-Speech/k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
WenzheLiu-Speech/Maximilian
C++ Audio and Music DSP Library
WenzheLiu-Speech/Microphone-Array-Generalization-for-Multichannel-Narrowband-Deep-Speech-Enhancement
This is the microphone array generalization investigation based on previous Narrow Band Deep Filtering methods.
WenzheLiu-Speech/multi_quantization
WenzheLiu-Speech/NELE-GAN
Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement
WenzheLiu-Speech/pam-nac
Psychoacoustic Calibration for Efficient Neural Audio Coding
WenzheLiu-Speech/PercepNet
(Work In Progress) Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech
WenzheLiu-Speech/Percepnet-Keras
percepnet implemented using Keras, still need to be optimized and tuned.
WenzheLiu-Speech/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
WenzheLiu-Speech/pytorch_complex
A temporal module for PyTorch-ComplexTensor
WenzheLiu-Speech/SoundStream
This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf
WenzheLiu-Speech/vector-quantize-pytorch
Vector Quantization, in Pytorch
WenzheLiu-Speech/Video_Conference_Enhancer
A software that supports real time video&audio processing for meeting application.