Text similarities are not in descending order

Question

Text similarities are not in descending order

Closed this issue a year ago · 1 comments

gunesevitan commented a year ago

I was debugging my way around and noticed something. I create image features like this

features = ci.image_to_features(image)

and retrieve top-k items

top_labels = ci.mediums.rank(features, 10, reverse=False)

When I check the similarities using
ci.similarities(features, top_labels)

I get values like this

[0.1334228515625, 0.1431884765625, 0.12371826171875, 0.1591796875, 0.267578125, 0.1026611328125, 0.140380859375, 0.1810302734375, 0.094482421875, 0.1348876953125]

Aren't those values supposed be in descending order since top_labels are selected with torch.topk?

Answer 1 · 2023-03-31T19:26:32.000Z

I finally understood why that happens. _rank method in LabelTable class normalizes text_features with all embeddings but similarities method in Interrogator class normalizes text_features only top-k selected embeddings. Discrepancy isn't a problem here so I'm closing this.