nlpub/pymystem3

Lemmatization errors in verbs

olesar opened this issue · 0 comments

Here comes a (non-exhaustive) list of verb forms lemmatized incorrectly by Mystem(). The list was compiled as part of GramEval2020 evaluation survey of baseline tools. Based on UD-SynTagRus v.2.5.
FORM - verb form
LEMMA_MANUAL - lemma assigned by expert
LEMMA_UDPIPE - lemma given in UD-SynTagRus (and thus assigned by udpipe model)
LEMMA_MYSTEM3 - lemma assigned by pymystem3

lemmas_wrong_choice.txt