DAMO-NLP-SG/VCD
[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding
PythonApache-2.0
Issues
- 1
About greedy search in VCD
#18 opened by haohaodw - 9
- 2
Is the response of Qwen-VL very short?
#25 opened by zifuwan - 0
About seed
#30 opened by alex1243423 - 1
ValueError when initing
#28 opened by pspdada - 1
dtype
#27 opened by alex1243423 - 1
Ask for parameters
#26 opened by maming109 - 14
Inquiry Regarding Discrepancy in Results Using the Same Model and Methodology as Presented in Your Paper
#16 opened by obananas - 1
Inference Sample
#23 opened by Shinyzenith - 3
- 2
About the experiment results in the Table.1
#21 opened by tbbbk - 1
question about evaluation.
#17 opened by tt6746690 - 10
unable to reproducing the results of llava
#9 opened by frankRenlf - 0
VCD in LVLMs
#19 opened by sunzjz - 4
About the Experimental Setup
#15 opened by haohaodw - 0
Issues about importing llava
#14 opened by Melon-Xu - 11
- 3
model_kwargs_cd and model_kwargs problem
#7 opened by Mr-xiu - 1
might be a waste of resources
#12 opened by SDaoer - 1
- 1
unable to reproduce the results of Table 7
#11 opened by XueJiang16 - 4
- 0
If two confidence levels of original and distorted inputs are high and similar, can plausibility constraints make negative effect?
#6 opened by QiushiYang - 2
- 6
Decoding hyperparameter details
#4 opened by yuezih - 1
can you provide the image subsets you used for evaluation because the whole set of gqa and coco is so large
#3 opened by yfzhang114 - 2
- 8