Taiyi-CLIP-Roberta-102M-Chinese Finetuning报错

Question

Taiyi-CLIP-Roberta-102M-Chinese Finetuning报错

1306825592 opened this issue 5 months ago · 0 comments

再Fengshenbang-LM/fengshen/examples/clip_finetune/clip_finetune_flickr.py中
计算相似性这里
打印出来的shape显示


logits_per_image = (logit_scale * image_features @ text_features.t()).detach().cpu()
logit_scale 为 torch.Size([5000, 512]) image_features torch.Size([5000, 2]) label 为5000

包括这里也有shape 不一致的情况

请问是我复现方式不对吗