VCoder: Versatile Vision Encoders for Multimodal Large Language Models, arXiv 2023 / CVPR 2024
Primary LanguagePythonApache License 2.0Apache-2.0