zexupan
Algorithm engineer @ AlibabaGroup; Visiting research scientist @ MERL; PhD @ NUS. Working on speech extraction and multimedia.
National University of SingaporeSingapore
zexupan's Stars
xcmyz/FastSpeech
The Implementation of FastSpeech based on pytorch.
ming024/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
jefflai108/Contrastive-Predictive-Coding-PyTorch
Contrastive Predictive Coding for Automatic Speaker Verification
mpariente/pystoi
Python implementation of the Short Term Objective Intelligibility measure
clovaai/voxceleb_trainer
In defence of metric learning for speaker recognition
Jungjee/RawNet
Official repository for RawNet, RawNet2, and RawNet3
nanahou/Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
ludlows/PESQ
PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)
etzinis/sudo_rm_rf
Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.
smeetrs/deep_avsr
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
fatchord/WaveRNN
WaveRNN Vocoder + TTS
JusperLee/Dual-Path-RNN-Pytorch
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch
gemengtju/Tutorial_Separation
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
mpc001/Lipreading_using_Temporal_Convolutional_Networks
ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
foorenxiang/OHR400Dashboard
UAV Flight Analysis and ML-powered Rolling Launch Control System. Written in Python and q/kdb+. Deployed at:
kaituoxu/Conv-TasNet
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
lingtengqiu/Facial_Expression_Similarity
This project aims at providing a fast, modular reference implementation for A Compact Embedding for Facial Expression Similarity models using PyTorch.
JusperLee/Speech-Separation-Paper-Tutorial
A must-read paper for speech separation based on neural networks
JusperLee/Looking-to-Listen-at-the-Cocktail-Party
Executable code based on Google articles