yossigandelsman/clip_text_span

official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"

Jupyter NotebookMIT

Issues

A question about the result of compute_ablations.py
#15 opened a month ago by LZY-233
2
Questions about the explanation of output concepts
#12 opened a month ago by RuoyuChen10
0
Segmentation fault when using compute segmentation.py
#13 opened a month ago by William-Chittavong
0
A question about the result of compute_ablations.py
#14 opened a month ago by LZY-233
0
About text_descriptions
#11 opened 5 months ago by dbsdmlgus50
2
A question on token decomposition
#10 opened 8 months ago by X-funbean
1
TEXTSPAN Algorithm
#9 opened 8 months ago by toffeecat
1
How to hook only last few layers?
#8 opened 9 months ago by tangli-udel
0
Did you compare the zero-shot segmentation performance between initial clip and your proposed image tokens decompositions ?
#6 opened 9 months ago by Yang-bug-star
1
Whether you evaluate the direct effect of different layers on zero-shot classification accuracy on the test set or on the same validation set used to calculate the mean
#4 opened 9 months ago by Yang-bug-star
4
How to unroll the direct effect to find a second-order effect and how to remove such second-order effect? Could you explain more on detail.
#7 opened 9 months ago by Yang-bug-star
1
Here, 'base' refers to randomly ablating 10 heads, or does it refer to the original OpenCLIP?
#5 opened 9 months ago by Yang-bug-star
1
Queries on Equation 5 and 6 Notations
#3 opened 9 months ago by chu7zpah
4
Queries on MultiheadAttention implementation
#2 opened a year ago by thapaliya19
1
arXiv PDF is not available
#1 opened a year ago by SakurajimaMaiii
2