A straightforward implementation of the CLIP model with detailed comments, for educational purposes :-)
Primary LanguagePython