haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
PythonApache-2.0
Issues
- 0
- 0
- 0
[Usage] Error in loading mistral 7b weights
#1817 opened by bryanlinnan - 1
使用finetune_task_lora.sh微调llava_1.5模型无法运行
#1811 opened by zhangye0402 - 0
[Discussion] Newbie alert - asked below question llava:13b about guitar scales and its giving wrong information.
#1816 opened by ginamdar - 0
Can llava-v1.5-7b accept the images smaller than 336? without resize, just direct go into the network?
#1815 opened by 924973292 - 1
[Question] Fintune llava-v1.5-7b just use all textvqa data, but get Accuracy: 0.00%
#1789 opened by ethanyys - 0
- 2
- 1
- 0
[Usage] Cannot Launch a gradio web server
#1812 opened by LiuChang-ao - 0
使用finetune_task_lora.sh微调llava_1.5模型无法运行
#1810 opened by zhangye0402 - 0
- 1
- 0
[Question] Abnormal Generation after finetuning ONLY single label classification dataset
#1796 opened by enkaranfiles - 1
- 0
[Usage] peft 1.40.0 requires transformers at least 0.47
#1794 opened by shtu-ryan - 1
[Question] Model parameters during finetuning (prints me only mm_projector parameters)
#1784 opened by daulettoibazar - 0
- 1
[Usage] Missing file 'model_vqa_qbench'
#1764 opened by eslambakr - 0
[Question] grad_norm=None
#1787 opened by Pixel-anter - 0
Issue with 4-bit Quantization for LLaVA-NeXT-Video-32B Model on A100-40GB GPU
#1791 opened by Rachel0901 - 1
[Question] When I reproduced the second stage using finetune.sh, the processing time per image was too slow
#1790 opened by TanmouTT - 2
- 1
[Usage] Infer on model finetuned using finetune_qlora.sh
#1771 opened by amagzari - 0
[Question] demo website error https://llava.hliu.cc/
#1786 opened by zkailinzhang - 0
[Question] llava 预训练中断,继续预训练默认加载模型报错
#1785 opened by cqray1990 - 0
[Question] Does llava support dynamic image tokens input?
#1783 opened by hktk07 - 0
[Discussion] loss function of finetune llava 1.5 with sft
#1782 opened by bollossom - 0
[Question] where can download OCR-VQA data?
#1781 opened by cqray1990 - 0
- 0
[Question] pretrain data
#1779 opened by cqray1990 - 0
[Question] I trained the llava-1.5-13B model, but when evaluating, the inference answer is always empty and the speed is very slow.
#1778 opened by ykzqjyhhh - 0
- 1
- 0
[Usage] json.decoder.JSONDecodeError: Expecting ',' delimiter: line 1559608 column 82 (char 39444904)
#1775 opened by tianke0711 - 0
[Question] Image-text match
#1774 opened by hazardout - 0
[Usage] The issue encountered when fine-tuning llava_mistral1.6 using LoRA
#1772 opened by yuwang4321 - 0
Where is convert_answer_to_mme.py?
#1769 opened by Tramac - 0
[Question] disable print
#1768 opened by LiXinYuann - 1
[Question] LLaVA 1.5 7B model fine-tune -- pydantic
#1765 opened by yiwei-chenn - 0
[Question] Pretrain preprocess
#1767 opened by leo-young - 0
[Question] Issue with trying to reproduce the results of LLaVA-Bench-in-the-Wild on LLaVA-v1.5-7b. Has anyone who's reproduced it got this error?
#1766 opened by cookiesupers22 - 0
[Usage] Error during evaluation for image.../image.png: Expected all tensors to be on the same device, but found at least two devices, cpu and cuda:0! (when checking argument for argument mat1 in method wrapper_CUDA_addmm)
#1762 opened by ChathurangiShyalika - 0
Fine tune for object detection
#1761 opened by ck-amrahd - 0
[Discussion] pip install e . error
#1760 opened by JavaWebT - 0
[Usage] Training process get stuck in the last iteration of instruction finetuning phrase.
#1759 opened by fmy7834 - 1
- 0
[Question] Is it possible to extract the latent representation of the image input from the model?
#1758 opened by Tizzzzy - 0