Dump results

Question

forrestbao opened this issue 2 years ago · 1 comments

Additional metrics to test
- [high priority] BERTScore-sentence, MNLI, {RoBERTa, DeBERTA}, Entail - Contradict, top-k and top-p
- [medium] BERTScore-original, DeBERTa -- to see how much language models impact it
- ~~[low] BERTScore-sentence, cosine, DeBERTa -- again, to see the impact of language models~~
how to print result into Google Sheets
Done SigmaWe/EvalBase#4

Answer 1 · 2023-01-14T08:14:08.000Z

Please run BERTScore sentence, non-MNLI just simple cosine, but with sentence weighting when you have time. @TURX