Pinned Repositories
CNMT
code for Confidence-aware Non-repetitive Multimodal Transformers for TextCaps (AAAI 2021)
compound-word-transformer
Official implementation of compound word transformer (AAAI'21)
Decompose-Single-Image-Into-Layers
DeepLearningTutorials
Deep Learning Tutorial notes and code. See the wiki for more info.
dominant-colors-of-video
Detecting the dominant colors frame by frame of a video and outputs an image to visualize them
Foley-Music
PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "
FS2
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
GraduationProject_Undergraduate_ClaraZhang
This repository is built for Luwen Zhang's graduation project 《Implementation of personalized home decoration platform based on augmented reality》.
ShadowsocksX-NG
Next Generation of ShadowsocksX
youtube-8m
Starter code for working with the YouTube-8M dataset.
clarazwen's Repositories
clarazwen/GraduationProject_Undergraduate_ClaraZhang
This repository is built for Luwen Zhang's graduation project 《Implementation of personalized home decoration platform based on augmented reality》.
clarazwen/CNMT
code for Confidence-aware Non-repetitive Multimodal Transformers for TextCaps (AAAI 2021)
clarazwen/compound-word-transformer
Official implementation of compound word transformer (AAAI'21)
clarazwen/DeepLearningTutorials
Deep Learning Tutorial notes and code. See the wiki for more info.
clarazwen/Foley-Music
PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "
clarazwen/FS2
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
clarazwen/ShadowsocksX-NG
Next Generation of ShadowsocksX
clarazwen/youtube-8m
Starter code for working with the YouTube-8M dataset.
clarazwen/FS2-ming024
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
clarazwen/HomePage
clarazwen/HomepageCAPT
clarazwen/maibiao
miniprogram
clarazwen/MELD
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation
clarazwen/MEmoR
Code and dataset of "MEmoR: A Dataset for Multimodal Emotion Reasoning in Videos" in MM'20.
clarazwen/movienet-tools
Tools for movie and video research
clarazwen/moviesColorFingerprint
These scripts generate the "color fingerprint" of a movie by clustering the dominant colors through time.
clarazwen/MovieSynopsisAssociation
Code for "A Graph-Based Framework to Bridge Movies and Synopses", ICCV2019
clarazwen/MRCMV
The release of the HoK400 and CFM400 datasets.
clarazwen/Music-Emotion-Recognition
A Machine Learning Approach of Emotional Model
clarazwen/MUStARD
Multimodal Sarcasm Detection Dataset
clarazwen/paper-reading
深度学习经典、新论文逐段精读
clarazwen/pytorch-book
PyTorch tutorials and fun projects including neural talk, neural style, poem writing, anime generation (《深度学习框架PyTorch:入门与实战》)
clarazwen/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
clarazwen/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
clarazwen/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
clarazwen/tflearn
Deep learning library featuring a higher-level API for TensorFlow.
clarazwen/UR-FUNNY
This repository presents UR-FUNNY dataset: first dataset for multimodal humor detection
clarazwen/VAANet
Official implementation of VAANet for Emotion Recognition (AAAI2020)
clarazwen/video-bgm-generation
Video Background Music Generation with Controllable Music Transformer (ACM MM 2021 Best Paper Award)
clarazwen/youtube-dl
Command-line program to download videos from YouTube.com and other video sites