/Ovis

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Primary LanguagePythonApache License 2.0Apache-2.0

Stargazers