Pinned Repositories
ALGCN
This repository contains the author's implementation in PyTorch for the paper "Adaptive Label-aware Graph Convolutional Networks for Cross-Modal Retrieval".
Backdoor-Trigger-Detection
Benchmark and code for Backdoor Trigger Detection
FS-MEVQA
Authors' source for ACM MM 2024 paper "Few-Shot Multimodal Explaining for Visual Question Answering"
get-away-now
Get away from UCAS!!!
GNN4CMR
PyTorch implementation of the AAAI-21 paper "Dual Adversarial Label-aware Graph Neural Networks for Cross-modal Retrieval" and the TPAMI-22 paper "Integrating Multi-Label Contrastive Learning with Dual Adversarial Graph Neural Networks for Cross-Modal Retrieval".
MMT
Authors' implementation of ACMMM2022 paper "MMT: Image-guided Story Ending Generation with Multimodal Memory Transformer".
PoisonCAM
Code for "Erasing Self-Supervised Learning Backdoor by Cluster Activation Masking".
VCIN
Authors's code for "Variational Causal Inference Network for Explanatory Visual Question Answering" and "Integrating Neural-Symbolic Reasoning with Variational Causal Inference Network for Explanatory Visual Question Answering"
VCNLG
Vision-Controllable Natural Language Generation
Awesome-Interpretable-Cross-modal-Reasoning
A Survey on Interpretable Cross-modal Reasoning
LivXue's Repositories
LivXue/GNN4CMR
PyTorch implementation of the AAAI-21 paper "Dual Adversarial Label-aware Graph Neural Networks for Cross-modal Retrieval" and the TPAMI-22 paper "Integrating Multi-Label Contrastive Learning with Dual Adversarial Graph Neural Networks for Cross-Modal Retrieval".
LivXue/get-away-now
Get away from UCAS!!!
LivXue/ALGCN
This repository contains the author's implementation in PyTorch for the paper "Adaptive Label-aware Graph Convolutional Networks for Cross-Modal Retrieval".
LivXue/VCIN
Authors's code for "Variational Causal Inference Network for Explanatory Visual Question Answering" and "Integrating Neural-Symbolic Reasoning with Variational Causal Inference Network for Explanatory Visual Question Answering"
LivXue/MMT
Authors' implementation of ACMMM2022 paper "MMT: Image-guided Story Ending Generation with Multimodal Memory Transformer".
LivXue/FS-MEVQA
Authors' source for ACM MM 2024 paper "Few-Shot Multimodal Explaining for Visual Question Answering"
LivXue/VCNLG
Vision-Controllable Natural Language Generation
LivXue/PoisonCAM
Code for "Erasing Self-Supervised Learning Backdoor by Cluster Activation Masking".
LivXue/Backdoor-Trigger-Detection
Benchmark and code for Backdoor Trigger Detection
LivXue/LININ
LININ: Logic Integrated Neural Inference Network for Explanatory Visual Question Answering
LivXue/MC-DPGMM
LivXue/NC3L