/Polos

[CVPR24 Highlights] Polos: Multimodal Metric Learning from Human Feedback for Image Captioning

Primary LanguagePythonBSD 3-Clause Clear LicenseBSD-3-Clause-Clear

Watchers