gqa

There are 14 repositories under gqa topic.

  • bknyaz/sgg

    Train Scene Graph Generation for Visual Genome and GQA in PyTorch >= 1.2 with improved zero and few-shot generalization.

    Language:Jupyter Notebook13851120
  • Bruce-Lee-LY/decoding_attention

    Decoding Attention is specially optimized for MHA, MQA, GQA and MLA using CUDA core for the decoding stage of LLM inference.

    Language:C++43204
  • phiyodr/vqaloader

    PyTorch DataLoader for many VQA datasets

    Language:Python13101
  • DigitalPhonetics/Intrinsic-Subgraph-Generation-for-VQA

    Predicting a subgraph alongside the answer in a graph based VQA model

    Language:Python9121
  • haukzero/from-mha-to-mla

    MHA, MQA, GQA, MLA 相关原理及简要实现

    Language:Python8100
  • leaderj1001/Vision-Language

    Vision-Language, Solve GQA(Visual Reasoning in the Real World) dataset.

    Language:Python5131
  • ycchen218/VisionQA-Llama2-OWLViT

    This is a multimodal model design for the Vision Question Answering (VQA) task. It integrates the Llama2 13B, OWL-ViT, and YOLOv8 models.

    Language:Python4200
  • ExplainableML/ZS-A2T

    [GCPR 2023] Zero-shot Translation of Attention Patterns in VQA Models to Natural Language

  • sahasourav17/IntelliAnswer

    A RAG-based question-answering system that processes user queries using local documents. It extracts relevant information to answer questions, falling back to a large language model when local sources are insufficient, ensuring accurate and contextual responses.

    Language:Python3100
  • alexmirrington/honours-thesis

    LaTeX files for my honours thesis: "Graph Attention Networks for Compositional Visual Question Answering"

    Language:TeX2102
  • alexmirrington/gat-vqa

    Source code for my honours thesis: "Graph Attention Networks for Compositional Visual Question Answering"

    Language:Python11272
  • eltoto1219/vltk

    A toolkit for vision-language processing to support the increasing popularity of mulit-modal transformer-based models

    Language:HTML1201
  • NMPoole/CS5014-MLVisualAttributes

    Case study of multi-layer perceptron and random forest techniques as applied to a subset of the GQA dataset.

    Language:Python0100
  • sushantkumar23/baby-llama

    Simple Llama architecture LLM in pytorch

    Language:Jupyter Notebook10