hipe-eval/HIPE-scorer

weighting of partial matches in the fuzzy evaluation scoring

aflueckiger opened this issue · 4 comments

@e-maud @mromanello
Do we reward partial matches as high as full matches (i.e. exact boundaries) in our fuzzy evaluation scoring? This would mean setting the formula the 0.5 to 1 in the formula below.

DeepinScreenshot_select-area_20200130162256

Source: http://www.davidsbatista.net/blog/2018/05/09/Named_Entity_Evaluation/

My take on it is that for fuzzy evaluation we are really relaxed, meaning setting it to 1. But your mileage might vary.

Idem, I would be very relaxed for fuzzy: one token overlap suffices to get full score, and very strict for exact: boundary should match exactly. So yes, 1 for partial matches in fuzzy setting.

Fine by me as well!

Ok, our implementation follows this proposal.