thaolmk54

Computer Vision, Visual Reasoning, Machine Learning, Machine Reasoning. @A2I2, Deakin University, Australia.

Pinned Repositories

Advanced-System-Software
Source code for Music Player demo
Language:C++0 1 00
anticipatr
Language:Python0 0 00
bottom-up-attention.pytorch
An PyTorch reimplementation of bottom-up-attention models
Language:Jupyter Notebook0 0 00
consistent_gqa
Train a consistent general question and answer machine
Language:Python0 0 00
CRAFT-Box2D
The simulator that we used to render 2D physics simulations for CRAFT dataset generation.
Language:C0 0 00
DensePose
A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body
Language:Jupyter Notebook0 0 00
EDABK1MakeFriendEasily
QR code and decode to exchange contact
Language:Java0 1 00
graph-primal-dual
Language:Python0 0 00
hcrn-videoqa
Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)
Language:Python130 7 1926
LOGNet-VQA
Implementation for the paper "Dynamic Language Binding in Relational Visual Reasoning" (Le et al., IJCAI 2020)
Language:Python13 2 04

thaolmk54's Repositories

thaolmk54/hcrn-videoqa
Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)
Language:Python130 7 1926
thaolmk54/LOGNet-VQA
Implementation for the paper "Dynamic Language Binding in Relational Visual Reasoning" (Le et al., IJCAI 2020)
Language:Python13 2 04
thaolmk54/Advanced-System-Software
Source code for Music Player demo
Language:C++0 1 00
thaolmk54/anticipatr
Language:Python0 0 00
thaolmk54/bottom-up-attention.pytorch
An PyTorch reimplementation of bottom-up-attention models
Language:Jupyter Notebook0 0 00
thaolmk54/consistent_gqa
Train a consistent general question and answer machine
Language:Python0 0 00
thaolmk54/CRAFT-Box2D
The simulator that we used to render 2D physics simulations for CRAFT dataset generation.
Language:C0 0 00
thaolmk54/DensePose
A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body
Language:Jupyter Notebook0 0 00
thaolmk54/EDABK1MakeFriendEasily
QR code and decode to exchange contact
Language:Java0 1 00
thaolmk54/graph-primal-dual
Language:Python0 0 00
thaolmk54/hcrn-long-short-videoqa
Implementation for the paper "Hierarchical Conditional Relation Networks for Long–Short-Form Video Question Answering"
0 1 00
thaolmk54/thaolmk54.github.io
Thao Minh Le personal site
Language:CSS0 1 00
thaolmk54/VQA2.0-Recent-Approachs-2018.pytorch
A pytroch reimplementation of "Bilinear Attention Network", "Intra- and Inter-modality Attention", "Learning Conditioned Graph Structures", "Learning to count object", "Bottom-up top-down" for Visual Question Answering 2.0
Language:Python0 0 00
thaolmk54/VideoPose3D
Efficient 3D human pose estimation in video using 2D keypoint trajectories
thaolmk54/VSU-Dataset
ARVSU dataset - Deep Learning Based Multi-modal Addressee Recognition in Visual Scenes with Utterances

thaolmk54

Pinned Repositories

Advanced-System-Software

anticipatr

bottom-up-attention.pytorch

consistent_gqa

CRAFT-Box2D

DensePose

EDABK1MakeFriendEasily

graph-primal-dual

hcrn-videoqa

LOGNet-VQA

thaolmk54's Repositories

thaolmk54/hcrn-videoqa

thaolmk54/LOGNet-VQA

thaolmk54/Advanced-System-Software

thaolmk54/anticipatr

thaolmk54/bottom-up-attention.pytorch

thaolmk54/consistent_gqa

thaolmk54/CRAFT-Box2D

thaolmk54/DensePose

thaolmk54/EDABK1MakeFriendEasily

thaolmk54/graph-primal-dual

thaolmk54/hcrn-long-short-videoqa

thaolmk54/thaolmk54.github.io

thaolmk54/VQA2.0-Recent-Approachs-2018.pytorch

thaolmk54/VideoPose3D

thaolmk54/VSU-Dataset