An open source implementation of CLIP.
Robust fine-tuning of zero-shot models
Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models
mittski doesn’t have any repository yet.