llava-rlhf/LLaVA-RLHF

Aligning LMMs with Factually Augmented RLHF

PythonGPL-3.0

Issues

reward base model missing
#27 opened 18 days ago by Ritz111
5
llava_ppo50k-aokvqa12k-vqa10k.json.数据怎么制作的呢？
#34 opened 18 days ago by Spring24ch
1
Quetion about reward model's score
#35 opened 18 days ago by DripNowhy
2
Question about the optimization time
#33 opened 2 months ago by JulioZhao97
1
Question About the reward model
#32 opened 2 months ago by tyxiong23
2
how to use the reward model isolatedly?
#28 opened 2 months ago by jxgu1016
1
复现RL训练时报错
#22 opened 2 months ago by Mr-Loevan
12
Question about padding side at RL model initialization.
#29 opened 5 months ago by L4zyy
1
Inquiry About Padding Strategies in LLaVA-RLHF Training
#31 opened 2 months ago by zhyang2226
1
Model testing
#26 opened 5 months ago by ernestoBocini
1
Image Data for RM
#30 opened 5 months ago by ChencongZJU
1
NotImplementedError in rl_trainer.py
#25 opened 6 months ago by janak11111
1
About 'hallucination' in preference dataset
#23 opened 6 months ago by davidluciolu
1
The accuracy of reward model seem to be low
#24 opened 6 months ago by Wizardcoast
1
RuntimeError: mat1 and mat2 must have the same dtype
#12 opened a year ago by HarrySSH
4
how can I find the eval_image files while evaluating the llava bench?
#21 opened 8 months ago by Amanda2024
1
The performance of the released ckpt is much lower than the scores reported in the paper
#20 opened 8 months ago by Weiyun1025
11
evaluation images missing?
#19 opened 10 months ago by findalexli
1
Training on RTX 4090
#18 opened a year ago by luohaowen2003
2
Question about insrtuction data
#17 opened a year ago by zhang-jr
1
Cannot reproduce results
#8 opened a year ago by Haoye17
13
Question
#15 opened a year ago by Fake10086
6
Question with regarding to training the reward model
#16 opened a year ago by TianjinTeda
6
Detailed Results of models on MMHal-Bench
#13 opened a year ago by vateye
1
When will the training codes be released?
#6 opened a year ago by feymanpriv
1
Will the RM be released?
#14 opened a year ago by findalexli
1
RuntimeError: The size of tensor a (577) must match the size of tensor b (257) at non-singleton dimension 1
#11 opened a year ago by HarrySSH
13
where is LLaVA-Fact-RM-13b-v1.5-336-lora-padding/checkpoint-200?
#10 opened a year ago by HarrySSH
2
Merge the models
#9 opened a year ago by ThierryDeruyttere
1
Can you use this with 4bit?
#7 opened a year ago by ThierryDeruyttere
0
Images for the SFT dataset
#4 opened a year ago by yuvalkirstain
2
error about call model
#3 opened a year ago by LiqiangJing
7
Any simple script to run this model using llava? (help)
#5 opened a year ago by barshag
1
Great work! Can I know if there is any implementation or script to call this model? Thanks.
#1 opened a year ago by WilTay1
2
how to use the model for testing
#2 opened a year ago by LiqiangJing
1