findalexli/mllm-dpo
[ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model
Jupyter Notebook
[ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model
Jupyter Notebook