Pinned Repositories
201700202116zhangzongmeng
2D-TAN
AAAI‘20 - Learning 2D Temporal Localization Networks for Moment Localization with Natural Language
AVE-ECCV18
Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018
bottom-up-attention-vqa
An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.
MLVQA
multitask learning for visual question answering via intra- and inter-modality modeling
multimodal_vtt
Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval
RecNet
A Pytorch implementation of "reconstruction network for video captioning", CVPR 2018
video-caption.pytorch-1
video-feature-extraction
Scripts for feature extraction of video using opencv & pytorch
What-I-Have-Read
Papers and Slides......Focus on NLP
AmmieQi's Repositories
AmmieQi/MLVQA
multitask learning for visual question answering via intra- and inter-modality modeling
AmmieQi/Cross-Attention-VizWiz-VQA
AmmieQi/CVPR21-Cogrounding_semantic_attention
AmmieQi/errormap_atteneval
Code for the paper: Pointing to Error-Inducing Regions to Improve Explanation Helpfulness
AmmieQi/GCoMem-NN4VideoQA
Implementation for the journal paper "Graph-enhanced Collaborative Memory Network for Video Question Answering" (Jianyu et al., IEEE Transactions on Multimedia (TMM. 2021)
AmmieQi/how-KG-ATT-help
Code and dataset for ACL 2021 paper "How Knowledge Graph and Attention Help?A Quantitative Analysis into Bag-level Relation Extraction".
AmmieQi/HyperShroudX
AmmieQi/ICCV2021-Papers-with-Code
ICCV 2021 论文和开源项目合集
AmmieQi/IROP_PosteriorPole_QualityAssurance
Performs Posterior Pole + Quality Exclusion for IROP
AmmieQi/ivadomed
Repository on the collaborative IVADO medical imaging project between the Mila and NeuroPoly labs.
AmmieQi/Knowledge-Projection-for-ERE
Source codes for #ACL2021 paper "From Discourse to Narrative: Knowledge Projection for Event Relation Extraction"
AmmieQi/LGSearch_DDGAN_PyTorch
Tracking by Joint Local and Global Search: A Target-aware Attention based Approach (IEEE TNNLS 2021)
AmmieQi/LMC-Memory
PyTorch Implementation of LMC-Memory (CVPR 2021 Oral)
AmmieQi/MASA-SR
MASA-SR: Matching Acceleration and Spatial Adaptation for Reference-Based Image Super-Resolution (CVPR2021)
AmmieQi/MASN-pytorch
pytorch implementation for the paper Attend What You Need: Motion-Appearance Synergistic Networks for Video Question Answering
AmmieQi/meta-analysis-classification
AmmieQi/mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
AmmieQi/NExT-QA
NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR2021)
AmmieQi/PUM
[CVPR 2021] Pytorch implementation for Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation
AmmieQi/rainbow-memory
Official pytorch implementation of Rainbow Memory (CVPR 2021)
AmmieQi/RPIN
Learning Long-term Visual Dynamics with Region Proposal Interaction Networks (ICLR 2021)
AmmieQi/RT-VQA
Realtime Video Question Answering
AmmieQi/SAGENet_demo
Demo code for SAGE-Net
AmmieQi/SAR
Code for our ACL2021 paper: "Check It Again: Progressive Visual Question Answering via Visual Entailment"
AmmieQi/SOBERT-XVQA-demo
Visual Question Answering with attention map explanations using the SOBERT-VQA model
AmmieQi/Temporal-Context-Aggregation-Network-Pytorch
CVPR2021: Temporal Context Aggregation Network for Temporal Action Proposal Refinement
AmmieQi/text2image
Text to Image Generation with Semantic-Spatial Aware GAN
AmmieQi/VGT
Video Graph Transformer for Video Question Answering (ECCV'22)
AmmieQi/video-qa-FAAAN
AmmieQi/videoqa_dataset_visualization
Load and visualize different datasets in video question answering