visual-instruction-tuning

There are 12 repositories under visual-instruction-tuning topic.

BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
16.3k 282 1481.1k
CircleRadon/Osprey
[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"
Language:Python830 13 4843
ictnlp/LLaVA-Mini
LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.
Language:Python526 10 2828
zjysteven/lmms-finetune
A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.
Language:Python333 8 6238
BAAI-DCAI/DataOptim
A collection of visual instruction tuning datasets.
Language:Python76 5 03
ChenDelong1999/polite-flamingo
🦩 Visual Instruction Tuning with Polite Flamingo - training multi-modal LLMs to be both clever and polite! (AAAI-24 Oral)
Language:Python64 5 53
fraction-ai/GAP
Gamified Adversarial Prompting (GAP): Crowdsourcing AI-weakness-targeting data through gamification. Boost model performance with community-driven, strategic data collection
Language:Python33 1 03
bigai-nlco/VideoTGB
[EMNLP 2024] A Video Chat Agent with Temporal Prior
Language:Python32 2 113
hllj/Vistral-V
Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.
Language:Python22 0 03
zjr2000/REVERIE
[ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models
Language:Python17 1 00
jingyi0000/Awesome-Visual-Instruction-Tuning
Visual Instruction Tuning towards General-Purpose Multimodal Model: A Survey
7 3 00
yueying-teng/generate-language-image-instruction-following-data
Mistral assisted visual instruction data generation by following LLaVA
Language:Python1 2 01

visual-instruction-tuning

BradyFU/Awesome-Multimodal-Large-Language-Models

CircleRadon/Osprey

ictnlp/LLaVA-Mini

zjysteven/lmms-finetune

BAAI-DCAI/DataOptim

ChenDelong1999/polite-flamingo

fraction-ai/GAP

bigai-nlco/VideoTGB

hllj/Vistral-V

zjr2000/REVERIE

jingyi0000/Awesome-Visual-Instruction-Tuning

yueying-teng/generate-language-image-instruction-following-data