Clarification about -e argument (and possible feature request)
Closed this issue · 1 comments
According to the docs, -e
is the "Max Levenshtein distance to allow during fuzzy matching. Default: 10 total insertions, deletions and substitutions."
Does this "total" mean across all the parsed parts of a name (including genus, specific epithet, author, etc), or is it applied to each part (so the default would be a total of 10 for genus plus 10 for specific epithet, etc)?
If the former, it may be useful to implement a way to allow different values of -e
to apply to each part of the name. It is likely that author names will have more variation than genus or specific epithet due to inconsistencies in abbreviation.
The -e
argument is the total distance across the whole name string (now documented). Unfortunately it would require a complete rewrite to allow different distances for different name parts. So while this is a good suggestion, I'll close this issue.