Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
Primary LanguagePython