[ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"
Primary LanguagePythonApache License 2.0Apache-2.0