joannahong
Research Scientist Intern @ Meta | Ph.D. candidate in Electrical Engineering @ KAIST
Redmond, WA
Pinned Repositories
AV-RelScore
Audio-Visual Corruption Modeling of our paper "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring" in CVPR23
av_hubert
A self-supervised learning framework for audio-visual speech
DiffV2S
espnet
End-to-End Speech Processing Toolkit
Face-Tells-Detailed-Expression
Face Tells Detailed Expression: Generating Comprehensive Facial Expression Sentence through Facial Action Units
Face-Tells-Detailed-Expression-Dataset
Text-based dataset with comprehensive facial expression sentence
Lip-to-Speech-Synthesis-in-the-Wild
A video demo of IEEE International Conference on Acoustics, Speech and Signal Processing submitted paper titled "Lip-to-Speech Synthesis in the Wild with Multi-task Learning"
Lip2Wav-pytorch
a PyTorch implementation of Lip2Wav
Speech-Reconstruction-with-Reminiscent-Sound-via-Visual-Voice-Memory
Demo of IEEE TASLP submitted paper titled "Speech Reconstruction with Reminiscent Sound via Visual Voice Memory"
Visagesyntalk
The video demo of ECCV2022 paper titled "Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection"
joannahong's Repositories
joannahong/Lip2Wav-pytorch
a PyTorch implementation of Lip2Wav
joannahong/AV-RelScore
Audio-Visual Corruption Modeling of our paper "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring" in CVPR23
joannahong/DiffV2S
joannahong/Face-Tells-Detailed-Expression-Dataset
Text-based dataset with comprehensive facial expression sentence
joannahong/Lip-to-Speech-Synthesis-in-the-Wild
A video demo of IEEE International Conference on Acoustics, Speech and Signal Processing submitted paper titled "Lip-to-Speech Synthesis in the Wild with Multi-task Learning"
joannahong/Speech-Reconstruction-with-Reminiscent-Sound-via-Visual-Voice-Memory
Demo of IEEE TASLP submitted paper titled "Speech Reconstruction with Reminiscent Sound via Visual Voice Memory"
joannahong/Face-Tells-Detailed-Expression
Face Tells Detailed Expression: Generating Comprehensive Facial Expression Sentence through Facial Action Units
joannahong/Visagesyntalk
The video demo of ECCV2022 paper titled "Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection"
joannahong/av_hubert
A self-supervised learning framework for audio-visual speech
joannahong/espnet
End-to-End Speech Processing Toolkit
joannahong/joannahong.github.io
joannahong/modern-resume-theme
A modern static resume template and theme. Powered by Jekyll and GitHub pages.
joannahong/Shine-Theme
FREE Bootstrap 5 Light Mode Resume/CV Template for Developers
joannahong/Visual_Speech_Recognition_for_Multiple_Languages
Visual Speech Recognition for Multiple Languages