Issues
- 1
- 9
- 0
question about the Labeler selection
#39 opened by lyklly - 4
question about the training loss?
#38 opened by lufanma - 9
batch size=1 , only a Nvidia a100 memory=80, why display cuda out of memory?
#22 opened by AderonHuang - 3
Error when initing
#34 opened by pspdada - 0
Support for Retraining MiniCPM-V?
#35 opened by DarioPTWR - 3
vision_tower seem not load
#26 opened by zhangzef - 2
where to find the lora fine-tuned checkpoints?
#32 opened by lufanma - 2
Training bugs - UnboundLocalError: local variable 'df' referenced before assignment
#29 opened by youthHan - 2
The LoRA training codes and scripts
#11 opened by darkpromise98 - 3
Issues for Implementation
#30 opened by injadlu - 1
- 1
about the RefoMB evaluation experiment
#24 opened by Molly-3000 - 4
- 4
Questions Regarding the Training Data and Settings for LLaVA as Used in the Paper
#15 opened by Timsty1 - 1
Would the data generation code be released?
#21 opened by Gaffey - 2
the actual number of samples of the huggingface RLAIF-V-Dataset is 83k, not 30k?
#25 opened by Molly-3000 - 4
- 3
我想问下 数据中logps怎么来的
#16 opened by Spring24ch - 1
请问一下Iterative alignment是如何在代码中实现的呢?
#23 opened by HeLeHanPrivate - 1
divide and conquer
#18 opened by dreaming12580 - 1
Performance of POPE?
#17 opened by shufangxun - 2
- 3
- 2
- 1
请问下divide的模型是额外finetune过的么?
#12 opened by menglin0320 - 1
Error when loading datasets split
#13 opened by Xuchen-Li - 1
control the output threshold of a large model
#10 opened by zfr00 - 0
- 2
keyerror:idx,I have changed the data_dir,but when I run the train script,I occured the error.How to fix
#7 opened by XiaoLei2123 - 3
Error loading the parquet dataset
#5 opened by charismaticchiu - 3
- 2
- 1
dpo_preference_processor not defined
#3 opened by RifleZhang - 2
ref_win_logp
#1 opened by buptlihang