LijieFan/LaCLIP

Performance on Downstream Tasks

praeclarumjj3 opened this issue · 1 comments

Hi, thanks for the great work!

I wonder how much improvement LaCLIP demonstrates on downstream tasks like zero-shot image understanding (segmentation, detection, etc.), where the community has widely leveraged CLIP. Did you perform any experiments along these lines?

Thanks!

Hi, @praeclarumjj3,

We haven't tried zero-shot segmentation or detection, as it's out of the scope of this paper. But I am also wondering how it would perform.