Benchmarking `simple` and `spacylemma`

Question

Benchmarking `simple` and `spacylemma`

Opened this issue a year ago · 0 comments

It would be interesting to compare the performance of the two approaches on different languages.
We could use word lists (https://github.com/michmech/lemmatization-lists) which are context-free lemmas for words or we can use UD (https://universaldependencies.org/) to lemmatize in context.

Comparing the time and performance between simple and spacylemma per language can be informative, and direct us to using a default system on a per-language basis