visual-reasoning
There are 32 repositories under visual-reasoning topic.
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
MILVLG/mcan-vqa
Deep Modular Co-Attention Networks for Visual Question Answering
ethanjperez/film
FiLM: Visual Reasoning with a General Conditioning Layer
floodsung/Deep-Reasoning-Papers
Recent Papers including Neural Symbolic Reasoning, Logical Reasoning, Visual Reasoning, planning and any other topics connecting deep learning and reasoning
WellyZhang/RAVEN
RAVEN: A Dataset for Relational and Analogical Visual rEasoNing
sdc17/UPop
[ICML 2023] UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers.
shijx12/XNM-Net
Pytorch implementation of "Explainable and Explicit Visual Reasoning over Scene Graphs "
NVlabs/Bongard-HOI
[CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning
NVlabs/RelViT
[ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning
cobanov/image-captioning
Image captioning using python and BLIP
hughplay/TVR
:boom: Transformation Driven Visual Reasoning - CVPR 2021
bezorro/ACMN-Pytorch
Visual Question Reasoning on General Dependency Tree
WellyZhang/CoPINet
Learning Perceptual Inference by Contrasting
sdc17/CrossGET
[ICML 2024] CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers.
WellyZhang/PrAE
Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution
catalina17/VideoNavQA
An alternative EQA paradigm and informative benchmark + models (BMVC 2019, ViGIL 2019 spotlight)
hughplay/Visual-Reasoning-Papers
📄 A curated list of visual reasoning papers.
WellyZhang/ACRE
ACRE: Abstract Causal REasoning Beyond Covariation
jaleedkhan/neusire
NeuSyRE: A Neuro-Symbolic Visual Understanding and Reasoning Framework based on Scene Graph Enrichment
WellyZhang/ALANS
Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning
aelnouby/Relational-Networks
Pytorch implementation of " A simple neural network module for relational reasoning" paper aka Relational networks for visual reasoning.
marialymperaiou/knowledge-enhanced-multimodal-learning
A list of research papers on knowledge-enhanced multimodal learning
Sina-Baharlou/VisualGenome-to-Depth
Convert RGB images of Visual-Genome dataset to Depth Maps.
wentaoheunnc/HCV-ARR
[AAAI 2023] Hierarchical ConViT with Attention-based Relational Reasoner for Visual Analogical Reasoning
alexmirrington/honours-thesis
LaTeX files for my honours thesis: "Graph Attention Networks for Compositional Visual Question Answering"
jaehyunnn/RelationalNetwork_pytorch
An un-official implementation of Relational Network [A. Santoro et al., 2017] (PyTorch)
rs9000/VisualReasoning_MMnet
Visual reasoning modular memory network
alexmirrington/gat-vqa
Source code for my honours thesis: "Graph Attention Networks for Compositional Visual Question Answering"
markvasin/MSc-Project
Multimodal Learning and Reasoning for Visual Question Answering
markvasin/openvqa
Implementation of the VQA model from my MSc project
markvasin/nscl_reproducability_challenge
Reproducibility Challenge - The Neuro-Symbolic Concept Learner