vqa-dataset

There are 38 repositories under vqa-dataset topic.

vztu/BVQA_Benchmark
A resource list and performance benchmark for blind video quality assessment (BVQA) models on user-generated content (UGC) datasets. [IEEE TIP'2021] "UGC-VQA: Benchmarking Blind Video Quality Assessment for User Generated Content", Zhengzhong Tu, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik
Language:Python121 4 714
abachaa/VQA-Med-2019
Visual Question Answering in the Medical Domain VQA-Med 2019
83 2 224
Cloud-CV/VQA
CloudCV Visual Question Answering Demo
Language:Lua66 11 324
sutdcv/SUTD-TrafficQA
[CVPR2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events
Language:JavaScript53 5 62
findalexli/SciGraphQA
SciGraphQA: Large-Scale Synthetic Multi-Turn Question-Answering Dataset for Scientific Graphs
Language:Jupyter Notebook39 1 02
vzhou842/easy-VQA
The Easy Visual Question Answering dataset.
Language:Python32 3 010
CAMMA-public/SSG-VQA
SSG-VQA is a Visual Question Answering (VQA) dataset on laparoscopic videos providing diverse, geometrically grounded, unbiased and surgical action-oriented queries generated using scene graphs.
Language:Python29 4 41
badripatro/awesome-vqg
Visual Question Generation reading list
27 4 04
Letian2003/C-VQA
Counterfactual Reasoning VQA Dataset
Language:Python24 3 22
abachaa/VQA-Med-2021
VQA-Med 2021
Language:Python17 6 23
yanx27/CLEVR3D
CLEVR3D Dataset: Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation
Language:Python15 2 71
google-research-datasets/maverics
MAVERICS (Manually-vAlidated Vq^2a Examples fRom Image-Caption datasetS) is a suite of test-only benchmarks for visual question answering (VQA).
14 4 21
badripatro/MDN-VQG
10 3 03
yousefkotp/Visual-Question-Answering
A Light weight deep learning model with with a web application to answer image-based questions with a non-generative approach for the VizWiz grand challenge 2023 by carefully curating the answer vocabulary and adding linear layer on top of Open AI's CLIP model as image and text encoder
Language:Jupyter Notebook10 3 14
csebuetnlp/IllusionVQA
This repository contains the data and code of the paper titled "IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language Models"
Language:Jupyter Notebook9 2 01
chakravarthi589/Video-Question-Answering_Resources
Video Question Answering | Video QA | VQA
84
lisamalani/VLR_term_project
Multi-page document understanding and VQA using OCR-free method
Language:Python6 3 24
fraction-ai/GAP
Gamified Adversarial Prompting (GAP): Crowdsourcing AI-weakness-targeting data through gamification. Boost model performance with community-driven, strategic data collection
Language:Python50
ghazaleh-mahmoodi/lxmert_compression
B.Sc. Final Project: LXMERT Model Compression for Visual Question Answering.
Language:Python5 1 01
VibhuJawa/vqa-2018
This repo implements attention networks for visual question answering
Language:Python4 2 01
gutbash/lmm-graph-vision
How well do the GPT-4V, Gemini Pro Vision, and Claude 3 Opus models perform zero-shot vision tasks on data structures?
Language:Python3 3 71
manoja328/vqatools
API for VQA , visual 7w dataset
Language:Jupyter Notebook3 2 01
zeryabmoussaoui/Real-time-VQA
A real-time Visual Question Answering Framework
Language:Jupyter Notebook3 4 22
IAmS4n/Visual-Question-Answering
Investigation on VQA dataset. TensorFlow is utilized for the implementation of a solution based on CNN and RNN architectures plus some ideas such as Attention and Positional features.
Language:Python2 1 02
juletx/egunean-behin-vqa
Egunean Behin Visual Question Answering Dataset
Language:Jupyter Notebook2 1 00
radonys/CFB-VQA
VQA Challenge - hosted on Hasura using Flask
Language:Python2 2 10
rentainhe/TRAR-Feature-Extraction
Grid features extraction for ICCV 2021 paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"
Language:Python2 1 01
zeryabmoussaoui/VQA-dataset-Generator
Language:Jupyter Notebook2 3 02
abdur75648/MedicalGPT
Medical Report Generation And VQA (Adapting XrayGPT to Any Modality)
Language:Python1 1 00
chandrakanthm/visual-question-generator
Language:Python1 3 00
jiayi-wei/vqa-tf2
Language:Python1 3 00
nishitmehta1/Deep-Image-Understanding-Visual-Question-Answering
Language:Python1 2 00
shivam1423/VQA
Visual Question Answer (VQA) software! Powered by Flask, this project seamlessly combines images and questions to generate accurate responses. Explore the world of interactive visual understanding with ease.
Language:HTML1 2 00
thatAverageGuy/EarlyFusion-on-EasyVQA
Streamlit app for demonstrating multi-modal(vision+language) modelling in Pytorch.
Language:Python1 1 00
dinesh-kumar-mr/MediVQA
Part of our final year project work involving complex NLP tasks along with experimentation on various datasets and different LLMs
Language:HTML0 1 00
MuhammadShavaiz/DL-Visual-Question-Answering
The Visual Question Answering (VQA) project features a model with a simple GUI that handles both images and videos. It uses OpenAI's CLIP for encoding images and questions and GPT-2 for decoding embeddings to answer questions based on the VQA Version 2 dataset, which includes 265,016 images with multiple questions and answers.
Language:Jupyter Notebook0 1 00

vqa-dataset

vztu/BVQA_Benchmark

abachaa/VQA-Med-2019

Cloud-CV/VQA

sutdcv/SUTD-TrafficQA

findalexli/SciGraphQA

vzhou842/easy-VQA

CAMMA-public/SSG-VQA

badripatro/awesome-vqg

Letian2003/C-VQA

abachaa/VQA-Med-2021

yanx27/CLEVR3D

google-research-datasets/maverics

badripatro/MDN-VQG

yousefkotp/Visual-Question-Answering

csebuetnlp/IllusionVQA

chakravarthi589/Video-Question-Answering_Resources

lisamalani/VLR_term_project

fraction-ai/GAP

ghazaleh-mahmoodi/lxmert_compression

VibhuJawa/vqa-2018

gutbash/lmm-graph-vision

manoja328/vqatools

zeryabmoussaoui/Real-time-VQA

IAmS4n/Visual-Question-Answering

juletx/egunean-behin-vqa

radonys/CFB-VQA

rentainhe/TRAR-Feature-Extraction

zeryabmoussaoui/VQA-dataset-Generator

abdur75648/MedicalGPT

chandrakanthm/visual-question-generator

jiayi-wei/vqa-tf2

nishitmehta1/Deep-Image-Understanding-Visual-Question-Answering

shivam1423/VQA

thatAverageGuy/EarlyFusion-on-EasyVQA

dinesh-kumar-mr/MediVQA

MuhammadShavaiz/DL-Visual-Question-Answering