seperate_words function based on \W+ re instead?
Closed this issue · 5 comments
jkterry1 commented
@fabianvf Is it my imagination or could we replace the entire separate_words function better with a \W+ or \W regex instead?
fabianvf commented
Honestly I'm not sure, does it make a difference? Wonder if there are any edge cases related to punctuation or something that will bite us.
jkterry1 commented
It'll be important for nonwestern languages.
…On Aug 31, 2017 4:18 PM, "Fabian von Feilitzsch" ***@***.***> wrote:
Honestly I'm not sure, does it make a difference? Wonder if there are any
edge cases related to punctuation or something that will bite us.
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#23 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AShd7CGb6Mvox5WWWtd2GjTIZhujfB7rks5sdxT8gaJpZM4OwesY>
.