Pzhang266

Universal Audio Processing (denoise, source separation, dereverbration ...)

Institute of Automation Chinese Academy of Sciences (CASIA)China Beijing

Pzhang266's Stars

coder2gwy/coder2gwy
互联网首份程序员考公指南，由3位已经进入体制内的前大厂程序员联合献上。
26.5k3.7k
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.3k6.4k
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Language:Python2.2k485
TaoRuijie/TalkNet-ASD
ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
Language:Python30868
facebookresearch/VisualVoice
Audio-Visual Speech Separation with Cross-Modal Consistency
Language:Python22035
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
Language:Python2.2k421
aispeech-lab/advr-avss
Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.
Language:Python174
huyanxin/DeepComplexCRN
Language:HTML39697
fakufaku/fast_bss_eval
A fast implementation of bss_eval metrics for blind source separation
Language:Python1308
zuoqing1988/ZQCNN
一款推理框架，同时有很多有用的demo，觉得好用请点星啊
Language:C2.2k508
fgnt/ci_sdr
Language:Python528
DingXiaoH/ResRep
ResRep: Lossless CNN Pruning via Decoupling Remembering and Forgetting (ICCV 2021)
Language:Python28736
pathak22/pyflow
Fast, accurate and easy to run dense optical flow with python wrapper
Language:C++648139
Pzhang266/Optical-Flow-Guided-Feature
Implementation Code of the paper Optical Flow Guided Feature, CVPR 2018
1
davisking/dlib
A toolkit for making real world machine learning and data analysis applications in C++
Language:C++13.5k3.4k
Pzhang266/avobjects
Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"
1
F-Tag/python-vad
py-webrtcvad wrapper for trimming speech clips
Language:Python4721
aispeech-lab/oavss
Pytorch implementation of the paper: "Online Audio-Visual Speech Separation with Generative Adversarial Training"
82
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python8.7k1.4k
dqqcasia/awesome-speech-translation
1751
timesler/facenet-pytorch
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
Language:Python4.5k945
VIPL-Audio-Visual-Speech-Understanding/LipNet-PyTorch
The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxiv.org/abs/1611.01599)
Language:Python20951
ujscjj/DPTNet
Language:Python10524
aispeech-lab/WASE
PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Environments"
245
haoyz/sym-STDP-SNN
Code for the model presented in the paper "A Biologically Plausible Supervised Learning Method for Spiking Neural Networks Using the Symmetric STDP Rule"
Language:C++4612
maum-ai/voicefilter
Unofficial PyTorch implementation of Google AI's VoiceFilter system
Language:Python1.1k226
Alexander-H-Liu/End-to-end-ASR-Pytorch
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.
Language:Python1.2k318
espnet/espnet
End-to-End Speech Processing Toolkit
Language:Python8.4k2.2k
JusperLee/Looking-to-Listen-at-the-Cocktail-Party
Executable code based on Google articles
Language:Python16444

Pzhang266

Pzhang266's Stars

coder2gwy/coder2gwy

facebookresearch/fairseq

s3prl/s3prl

TaoRuijie/TalkNet-ASD

facebookresearch/VisualVoice

asteroid-team/asteroid

aispeech-lab/advr-avss

huyanxin/DeepComplexCRN

fakufaku/fast_bss_eval

zuoqing1988/ZQCNN

fgnt/ci_sdr

DingXiaoH/ResRep

pathak22/pyflow

Pzhang266/Optical-Flow-Guided-Feature

davisking/dlib

Pzhang266/avobjects

F-Tag/python-vad

aispeech-lab/oavss

speechbrain/speechbrain

dqqcasia/awesome-speech-translation

timesler/facenet-pytorch

VIPL-Audio-Visual-Speech-Understanding/LipNet-PyTorch

ujscjj/DPTNet

aispeech-lab/WASE

haoyz/sym-STDP-SNN

maum-ai/voicefilter

Alexander-H-Liu/End-to-end-ASR-Pytorch

espnet/espnet

JusperLee/Looking-to-Listen-at-the-Cocktail-Party