gqa

There are 14 repositories under gqa topic.

bknyaz/sgg
Train Scene Graph Generation for Visual Genome and GQA in PyTorch >= 1.2 with improved zero and few-shot generalization.
Language:Jupyter Notebook138 5 1120
Bruce-Lee-LY/decoding_attention
Decoding Attention is specially optimized for MHA, MQA, GQA and MLA using CUDA core for the decoding stage of LLM inference.
Language:C++43 2 04
phiyodr/vqaloader
PyTorch DataLoader for many VQA datasets
Language:Python13 1 01
DigitalPhonetics/Intrinsic-Subgraph-Generation-for-VQA
Predicting a subgraph alongside the answer in a graph based VQA model
Language:Python9 1 21
haukzero/from-mha-to-mla
MHA, MQA, GQA, MLA 相关原理及简要实现
Language:Python8 1 00
leaderj1001/Vision-Language
Vision-Language, Solve GQA(Visual Reasoning in the Real World) dataset.
Language:Python5 1 31
ycchen218/VisionQA-Llama2-OWLViT
This is a multimodal model design for the Vision Question Answering (VQA) task. It integrates the Llama2 13B, OWL-ViT, and YOLOv8 models.
Language:Python4 2 00
ExplainableML/ZS-A2T
[GCPR 2023] Zero-shot Translation of Attention Patterns in VQA Models to Natural Language
3 3 00
sahasourav17/IntelliAnswer
A RAG-based question-answering system that processes user queries using local documents. It extracts relevant information to answer questions, falling back to a large language model when local sources are insufficient, ensuring accurate and contextual responses.
Language:Python3 1 00
alexmirrington/honours-thesis
LaTeX files for my honours thesis: "Graph Attention Networks for Compositional Visual Question Answering"
Language:TeX2 1 02
alexmirrington/gat-vqa
Source code for my honours thesis: "Graph Attention Networks for Compositional Visual Question Answering"
Language:Python1 1 272
eltoto1219/vltk
A toolkit for vision-language processing to support the increasing popularity of mulit-modal transformer-based models
Language:HTML1 2 01
NMPoole/CS5014-MLVisualAttributes
Case study of multi-layer perceptron and random forest techniques as applied to a subset of the GQA dataset.
Language:Python0 1 00
sushantkumar23/baby-llama
Simple Llama architecture LLM in pytorch
Language:Jupyter Notebook1 0

gqa

bknyaz/sgg

Bruce-Lee-LY/decoding_attention

phiyodr/vqaloader

DigitalPhonetics/Intrinsic-Subgraph-Generation-for-VQA

haukzero/from-mha-to-mla

leaderj1001/Vision-Language

ycchen218/VisionQA-Llama2-OWLViT

ExplainableML/ZS-A2T

sahasourav17/IntelliAnswer

alexmirrington/honours-thesis

alexmirrington/gat-vqa

eltoto1219/vltk

NMPoole/CS5014-MLVisualAttributes

sushantkumar23/baby-llama