optimaize/language-detector

Feature to weight prefix and suffix n-grams differently (higher)

Closed this issue · 1 comments

Affixes are important in detecting script-based languages like Latin. More important than what's in the middle of words.

Split borderFactor info prefixFactor and suffixFactor.