timoschick/dino

question about baseline in table 1

Closed this issue · 2 comments

Hi!

Impressive results, many thanks for the code! I have a Q about Table 1, where it says "supervised SBERT baseline", but in the rest of the paper it only mentions SBERT trained on NLI and not tuning on STS train. I guess supervised means also it's fine-tuned on STS training, or do I err?
Thanks, Juri

Hi Juri, supervised training is performed using only NLI data. This is mentioned in the caption of Table 1 (highlight mine):

Table 1: Spearman’s rank correlation on STS12–16, STSb and SICK without finetuning on task-specific examples for models with NLI supervision (“sup.”) and fully unsupervised (“unsup.”) models [...]

Thanks! I also saw now that some other papers same terminology. I'd rather call that zero-shot or something, but I understand now.