vlms

There are 16 repositories under vlms topic.

  • yueliu1999/Awesome-Jailbreak-on-LLMs

    Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.

  • tianyi-lab/HallusionBench

    [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

    Language:Python2634117
  • dvlab-research/VisionZip

    Official repo for "VisionZip: Longer is Better but Not Necessary in Vision Language Models"

    Language:Python1958
  • Beckschen/ViTamin

    [CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"

    Language:Python1886116
  • MCG-NJU/AWT

    [NeurIPS 2024] AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation

    Language:Python88402
  • foundation-multimodal-models/CAL

    [NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment

    Language:Python57052
  • Mamadou-Keita/VLM-DETECT

    [ICASSP 2024] The official repo for Harnessing the Power of Large Vision Language Models for Synthetic Image Detection

    Language:Python19232
  • ThomasVonWu/Awesome-VLMs-Strawberry

    A collection of VLMs papers, blogs, and projects, with a focus on VLMs in Autonomous Driving and related reasoning techniques.

    90
  • hucebot/words2contact

    Official implementation of "Words2Contact: Identifying Support Contacts from Verbal Instructions Using Foundation Models" (IEEE-RAS Humanoids 2024).

    Language:Python4300
  • Imageomics/VLM4Bio

    Code for VLM4Bio, a benchmark dataset of scientific question-answer pairs used to evaluate pretrained VLMs for trait discovery from biological images.

    Language:Python3202
  • angmavrogiannis/Embodied-Attribute-Detection

    Code Implementation for the paper: Discovering Object Attributes by Prompting Large Language Models with Perception-Action APIs

    Language:Python1100
  • KT313/assistant_base

    A custom framework for easy use of LLMs, VLMs, etc. supporting various modes and settings via web-ui

    Language:Jupyter Notebook1200
  • SrGrace/generative-ai-compass

    A comprehensive guide to navigating the world of generative artificial intelligence!

  • werywjw/MultiClimate

    [EMNLP 2024 Workshop NLP4PI]🌏 MultiClimate: Multimodal Stance Detection on Climate Change Videos 🌎

    Language:Jupyter Notebook1200
  • LiAo365/EPSR_VTG

    Language:Python00
  • krishnaura45/MemeShield

    Proactive Content Moderation Using LLMs and VLMs

    Language:Python20