CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)
Primary LanguagePythonMIT LicenseMIT