Clarification about -e argument (and possible feature request)

Question

Clarification about -e argument (and possible feature request)

Closed this issue 3 years ago · 1 comments

According to the docs, -e is the "Max Levenshtein distance to allow during fuzzy matching. Default: 10 total insertions, deletions and substitutions."

Does this "total" mean across all the parsed parts of a name (including genus, specific epithet, author, etc), or is it applied to each part (so the default would be a total of 10 for genus plus 10 for specific epithet, etc)?

If the former, it may be useful to implement a way to allow different values of -e to apply to each part of the name. It is likely that author names will have more variation than genus or specific epithet due to inconsistencies in abbreviation.

Answer 1 · 2021-10-14T14:44:07.000Z

The -e argument is the total distance across the whole name string (now documented). Unfortunately it would require a complete rewrite to allow different distances for different name parts. So while this is a good suggestion, I'll close this issue.