Pzhang266
Universal Audio Processing (denoise, source separation, dereverbration ...)
Institute of Automation Chinese Academy of Sciences (CASIA)China Beijing
Pzhang266's Stars
coder2gwy/coder2gwy
互联网首份程序员考公指南,由3位已经进入体制内的前大厂程序员联合献上。
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
TaoRuijie/TalkNet-ASD
ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
facebookresearch/VisualVoice
Audio-Visual Speech Separation with Cross-Modal Consistency
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
aispeech-lab/advr-avss
Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.
huyanxin/DeepComplexCRN
fakufaku/fast_bss_eval
A fast implementation of bss_eval metrics for blind source separation
zuoqing1988/ZQCNN
一款推理框架,同时有很多有用的demo,觉得好用请点星啊
fgnt/ci_sdr
DingXiaoH/ResRep
ResRep: Lossless CNN Pruning via Decoupling Remembering and Forgetting (ICCV 2021)
pathak22/pyflow
Fast, accurate and easy to run dense optical flow with python wrapper
Pzhang266/Optical-Flow-Guided-Feature
Implementation Code of the paper Optical Flow Guided Feature, CVPR 2018
davisking/dlib
A toolkit for making real world machine learning and data analysis applications in C++
Pzhang266/avobjects
Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"
F-Tag/python-vad
py-webrtcvad wrapper for trimming speech clips
aispeech-lab/oavss
Pytorch implementation of the paper: "Online Audio-Visual Speech Separation with Generative Adversarial Training"
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
dqqcasia/awesome-speech-translation
timesler/facenet-pytorch
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
VIPL-Audio-Visual-Speech-Understanding/LipNet-PyTorch
The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxiv.org/abs/1611.01599)
ujscjj/DPTNet
aispeech-lab/WASE
PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Environments"
haoyz/sym-STDP-SNN
Code for the model presented in the paper "A Biologically Plausible Supervised Learning Method for Spiking Neural Networks Using the Symmetric STDP Rule"
maum-ai/voicefilter
Unofficial PyTorch implementation of Google AI's VoiceFilter system
Alexander-H-Liu/End-to-end-ASR-Pytorch
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.
espnet/espnet
End-to-End Speech Processing Toolkit
JusperLee/Looking-to-Listen-at-the-Cocktail-Party
Executable code based on Google articles