My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"
Primary LanguagePythonMIT LicenseMIT