jfilter/clean-text

whitespace between emojis

cod3licious opened this issue · 1 comments

Thanks for this great library! :)

To facilitate tokenisation it would be great if additional whitespace could be added before and after each emoji.

I'm not sure if this can be easily done. There are emojis that consist of several sub-emojis. And I don't want to go into the direction of writing an emoji tokenizer just to add whitespace.

If people need some very special emoji handling, this other package may help: https://github.com/carpedm20/emoji

(If somebody comes up with an easy way to accomplish: I'm open to pull requests.)