Embed arbitrary modalities (images, audio, documents, etc) into large language models.
Primary LanguagePythonApache License 2.0Apache-2.0