vlms

There are 16 repositories under vlms topic.

yueliu1999/Awesome-Jailbreak-on-LLMs
Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.
417 22 039
tianyi-lab/HallusionBench
[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
Language:Python263 4 117
dvlab-research/VisionZip
Official repo for "VisionZip: Longer is Better but Not Necessary in Vision Language Models"
Language:Python1958
Beckschen/ViTamin
[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"
Language:Python188 6 116
MCG-NJU/AWT
[NeurIPS 2024] AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
Language:Python88 4 02
foundation-multimodal-models/CAL
[NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment
Language:Python57 0 52
Mamadou-Keita/VLM-DETECT
[ICASSP 2024] The official repo for Harnessing the Power of Large Vision Language Models for Synthetic Image Detection
Language:Python19 2 32
ThomasVonWu/Awesome-VLMs-Strawberry
A collection of VLMs papers, blogs, and projects, with a focus on VLMs in Autonomous Driving and related reasoning techniques.
90
hucebot/words2contact
Official implementation of "Words2Contact: Identifying Support Contacts from Verbal Instructions Using Foundation Models" (IEEE-RAS Humanoids 2024).
Language:Python4 3 00
Imageomics/VLM4Bio
Code for VLM4Bio, a benchmark dataset of scientific question-answer pairs used to evaluate pretrained VLMs for trait discovery from biological images.
Language:Python3 2 02
angmavrogiannis/Embodied-Attribute-Detection
Code Implementation for the paper: Discovering Object Attributes by Prompting Large Language Models with Perception-Action APIs
Language:Python1 1 00
KT313/assistant_base
A custom framework for easy use of LLMs, VLMs, etc. supporting various modes and settings via web-ui
Language:Jupyter Notebook1 2 00
SrGrace/generative-ai-compass
A comprehensive guide to navigating the world of generative artificial intelligence!
1 1 00
werywjw/MultiClimate
[EMNLP 2024 Workshop NLP4PI]🌏 MultiClimate: Multimodal Stance Detection on Climate Change Videos 🌎
Language:Jupyter Notebook1 2 00
LiAo365/EPSR_VTG
Language:Python00
krishnaura45/MemeShield
Proactive Content Moderation Using LLMs and VLMs
Language:Python2 0

vlms

yueliu1999/Awesome-Jailbreak-on-LLMs

tianyi-lab/HallusionBench

dvlab-research/VisionZip

Beckschen/ViTamin

MCG-NJU/AWT

foundation-multimodal-models/CAL

Mamadou-Keita/VLM-DETECT

ThomasVonWu/Awesome-VLMs-Strawberry

hucebot/words2contact

Imageomics/VLM4Bio

angmavrogiannis/Embodied-Attribute-Detection

KT313/assistant_base

SrGrace/generative-ai-compass

werywjw/MultiClimate

LiAo365/EPSR_VTG

krishnaura45/MemeShield