zexupan

Algorithm engineer @ AlibabaGroup; Visiting research scientist @ MERL; PhD @ NUS. Working on speech extraction and multimedia.

National University of SingaporeSingapore

zexupan's Stars

xcmyz/FastSpeech
The Implementation of FastSpeech based on pytorch.
Language:Python861213
ming024/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Language:Python1.9k544
jefflai108/Contrastive-Predictive-Coding-PyTorch
Contrastive Predictive Coding for Automatic Speaker Verification
Language:Python483100
mpariente/pystoi
Python implementation of the Short Term Objective Intelligibility measure
Language:MATLAB32959
clovaai/voxceleb_trainer
In defence of metric learning for speaker recognition
Language:Python1.1k274
Jungjee/RawNet
Official repository for RawNet, RawNet2, and RawNet3
Language:Python36353
nanahou/Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Language:MATLAB729151
ludlows/PESQ
PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)
Language:C54499
etzinis/sudo_rm_rf
Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.
Language:Jupyter Notebook31234
smeetrs/deep_avsr
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Language:Python21741
fatchord/WaveRNN
WaveRNN Vocoder + TTS
Language:Python2.1k698
JusperLee/Dual-Path-RNN-Pytorch
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch
Language:Python42766
gemengtju/Tutorial_Separation
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
Language:MATLAB45295
mpc001/Lipreading_using_Temporal_Convolutional_Networks
ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks
Language:Python398102
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
Language:Python2.3k424
foorenxiang/OHR400Dashboard
UAV Flight Analysis and ML-powered Rolling Launch Control System. Written in Python and q/kdb+. Deployed at:
Language:Python51
kaituoxu/Conv-TasNet
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
Language:Python687156
lingtengqiu/Facial_Expression_Similarity
This project aims at providing a fast, modular reference implementation for A Compact Embedding for Facial Expression Similarity models using PyTorch.
Language:Python182
JusperLee/Speech-Separation-Paper-Tutorial
A must-read paper for speech separation based on neural networks
764137
JusperLee/Looking-to-Listen-at-the-Cocktail-Party
Executable code based on Google articles
Language:Python16545

zexupan

zexupan's Stars

xcmyz/FastSpeech

ming024/FastSpeech2

jefflai108/Contrastive-Predictive-Coding-PyTorch

mpariente/pystoi

clovaai/voxceleb_trainer

Jungjee/RawNet

nanahou/Awesome-Speech-Enhancement

ludlows/PESQ

etzinis/sudo_rm_rf

smeetrs/deep_avsr

fatchord/WaveRNN

JusperLee/Dual-Path-RNN-Pytorch

gemengtju/Tutorial_Separation

mpc001/Lipreading_using_Temporal_Convolutional_Networks

asteroid-team/asteroid

foorenxiang/OHR400Dashboard

kaituoxu/Conv-TasNet

lingtengqiu/Facial_Expression_Similarity

JusperLee/Speech-Separation-Paper-Tutorial

JusperLee/Looking-to-Listen-at-the-Cocktail-Party