/vlm-rlaif

ACL'24 (Oral) Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback

Primary LanguagePythonApache License 2.0Apache-2.0

Watchers