wnhsu

Research Scientist @ Facebook AI Research (FAIR). Former PhD Student @ MIT Spoken Language Systems Group

Pinned Repositories

FactorizedHierarchicalVAE
This repository contains the code to reproduce the core results from the paper "Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data"
Language:Python153 7 827
PGLSTM_ASR
This repo contains codes to reproduce the core results of "A Prioritized Grid Long Short-Term Memory RNN for Speech Recognition"
Language:Shell3 1 00
ResDAVEnet-VQ
Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"
Language:Jupyter Notebook26 2 09
ReVISE
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement
Language:HTML13 3 10
ScalableFHVAE
This repository contains the code to reproduce the core results from the paper "Scalable Factorized Hierarchical Variational Autoencoders"
Language:Python52 5 519
semi-supervised-pytorch
Implementations of different VAE-based semi-supervised and generative models in PyTorch
Language:Python3 2 01
SpeechVAE
This repository contains the code to reproduce the core results from the paper "Learning Latent Representations for Speech Generation and Transformation".
Language:Python52 6 78
tacotron2_dev
Language:Jupyter Notebook1 1 00
tensorflow-wavenet
A TensorFlow implementation of DeepMind's WaveNet paper
Language:Python2 1 00
wavenet_vocoder
WaveNet vocoder
Language:Python1 2 00

wnhsu's Repositories

wnhsu/FactorizedHierarchicalVAE
This repository contains the code to reproduce the core results from the paper "Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data"
Language:Python153 7 827
wnhsu/ScalableFHVAE
This repository contains the code to reproduce the core results from the paper "Scalable Factorized Hierarchical Variational Autoencoders"
Language:Python52 5 519
wnhsu/SpeechVAE
This repository contains the code to reproduce the core results from the paper "Learning Latent Representations for Speech Generation and Transformation".
Language:Python52 6 78
wnhsu/ResDAVEnet-VQ
Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"
Language:Jupyter Notebook26 2 09
wnhsu/ReVISE
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement
Language:HTML13 3 10
wnhsu/PGLSTM_ASR
This repo contains codes to reproduce the core results of "A Prioritized Grid Long Short-Term Memory RNN for Speech Recognition"
Language:Shell3 1 00
wnhsu/semi-supervised-pytorch
Implementations of different VAE-based semi-supervised and generative models in PyTorch
Language:Python3 2 01
wnhsu/tensorflow-wavenet
A TensorFlow implementation of DeepMind's WaveNet paper
Language:Python2 1 00
wnhsu/tacotron2_dev
Language:Jupyter Notebook1 1 00
wnhsu/wavenet_vocoder
WaveNet vocoder
Language:Python1 2 00
wnhsu/ZeroSpeech2019_RLE_eval
ZeroSpeech 2019 evaluation with run-length encoding (RLE), metrics reported in ResDAVEnet-VQ.
Language:Python1 3 00
wnhsu/a-PyTorch-Tutorial-to-Image-Captioning
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
Language:Python2 0
wnhsu/ABXpy
ABX discrimination task in python
Language:Python1 0
wnhsu/CNTK
Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit
Language:C++1 0
wnhsu/einops
Deep learning operations reinvented (for pytorch, tensorflow, jax and others)
Language:Python0 0
wnhsu/espnet_tts_frontend
Text frontend for ESPnet tts recipes
Language:Python0 0
wnhsu/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python1 0
wnhsu/image-to-speech-demo
2 0
wnhsu/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Language:Shell1 0
wnhsu/show-attend-and-tell
TensorFlow Implementation of "Show, Attend and Tell"
Language:Jupyter Notebook1 0
wnhsu/wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
Language:C++2 0
wnhsu/wnhsu.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript0 01

wnhsu

Pinned Repositories

FactorizedHierarchicalVAE

PGLSTM_ASR

ResDAVEnet-VQ

ReVISE

ScalableFHVAE

semi-supervised-pytorch

SpeechVAE

tacotron2_dev

tensorflow-wavenet

wavenet_vocoder

wnhsu's Repositories

wnhsu/FactorizedHierarchicalVAE

wnhsu/ScalableFHVAE

wnhsu/SpeechVAE

wnhsu/ResDAVEnet-VQ

wnhsu/ReVISE

wnhsu/PGLSTM_ASR

wnhsu/semi-supervised-pytorch

wnhsu/tensorflow-wavenet

wnhsu/tacotron2_dev

wnhsu/wavenet_vocoder

wnhsu/ZeroSpeech2019_RLE_eval

wnhsu/a-PyTorch-Tutorial-to-Image-Captioning

wnhsu/ABXpy

wnhsu/CNTK

wnhsu/einops

wnhsu/espnet_tts_frontend

wnhsu/fairseq

wnhsu/image-to-speech-demo

wnhsu/kaldi

wnhsu/show-attend-and-tell

wnhsu/wav2letter

wnhsu/wnhsu.github.io