Pinned Repositories
CLEVRER
PyTorch implementation of ICLR 2020 paper "CLEVRER: CoLlision Events for Video REpresentation and Reasoning"
find_fallen_objects
Official implementation of CVPR 2022 paper "Finding Fallen Objects Via Asynchronous Audio-Visual Integration".
Foley-Music
PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "
GAT
Graph Attention Networks (https://arxiv.org/abs/1710.10903)
imageqa-san
code for Stacked attention networks for image question answering
OPEn
rnn
Recurrent Neural Network library for Torch7's nn
SCN_for_video_captioning
Using Semantic Compositional Networks for Video Captioning
tdw-transport-challenge
tdw-transport-challenge-starter-code
chuangg's Repositories
chuangg/CLEVRER
PyTorch implementation of ICLR 2020 paper "CLEVRER: CoLlision Events for Video REpresentation and Reasoning"
chuangg/Foley-Music
PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "
chuangg/tdw-transport-challenge-starter-code
chuangg/find_fallen_objects
Official implementation of CVPR 2022 paper "Finding Fallen Objects Via Asynchronous Audio-Visual Integration".
chuangg/OPEn
chuangg/tdw-transport-challenge
chuangg/GAT
Graph Attention Networks (https://arxiv.org/abs/1710.10903)
chuangg/imageqa-san
code for Stacked attention networks for image question answering
chuangg/rnn
Recurrent Neural Network library for Torch7's nn
chuangg/SCN_for_video_captioning
Using Semantic Compositional Networks for Video Captioning
chuangg/Semantic_Compositional_Nets
The Theano code for the CVPR 2017 paper "Semantic Compositional Networks for Visual Captioning"
chuangg/SfMLearner
An unsupervised learning framework for depth and ego-motion estimation from monocular videos
chuangg/stylenet-1
A pytorch implemention of "StyleNet: Generating Attractive Visual Captions with Styles"
chuangg/TensorFlow-Tutorials
Simple tutorials using Google's TensorFlow Framework
chuangg/tvnet
End-to-End Learning of Motion Representation for Video Understanding
chuangg/Youtube-8M
PaddlePaddle models for Youtube-8M Video Understanding Challenge
chuangg/mesh-transformer-jax
Model parallel transformers in JAX and Haiku
chuangg/sound-spaces
A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple tasks and applications.
chuangg/StyleNet
chuangg/vqs
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation