weighting of partial matches in the fuzzy evaluation scoring

Question

weighting of partial matches in the fuzzy evaluation scoring

aflueckiger opened this issue 4 years ago · 4 comments

@e-maud @mromanello
Do we reward partial matches as high as full matches (i.e. exact boundaries) in our fuzzy evaluation scoring? This would mean setting the formula the 0.5 to 1 in the formula below.

Source: http://www.davidsbatista.net/blog/2018/05/09/Named_Entity_Evaluation/

Answer 1 · 2020-01-30T16:02:10.000Z

My take on it is that for fuzzy evaluation we are really relaxed, meaning setting it to 1. But your mileage might vary.

Answer 2 · 2020-01-31T15:45:03.000Z

Idem, I would be very relaxed for fuzzy: one token overlap suffices to get full score, and very strict for exact: boundary should match exactly. So yes, 1 for partial matches in fuzzy setting.

Answer 3 · 2020-01-31T16:06:30.000Z

Fine by me as well!

Answer 4 · 2020-02-03T08:29:11.000Z

Ok, our implementation follows this proposal.