Benchmarking `simple` and `spacylemma`
Opened this issue · 0 comments
AmitMY commented
It would be interesting to compare the performance of the two approaches on different languages.
We could use word lists (https://github.com/michmech/lemmatization-lists) which are context-free lemmas for words or we can use UD (https://universaldependencies.org/) to lemmatize in context.
Comparing the time and performance between simple
and spacylemma
per language can be informative, and direct us to using a default system on a per-language basis