BurntSushi/aho-corasick

Tracking issue for full-unicode case-insensitive matching

dhardy opened this issue · 1 comments

The docs for ascii_case_insensitive state:

NOTE: In the future, support for full Unicode case insensitivity may be added, but ASCII case insensitivity is comparatively much simpler to add.

I don't currently need this feature but it could be nice to have, hence this tracking issue.

For those interested in a work-around, see this and this.

Aye. I may wind up closing this because I'm not quite convinced this is worth adding because of the implementation complexity. If you need something like this, it might make more sense to just use regex-automata.

The note in the docs is there to acknowledge the absence of Unicode handling, and it being missing was not an oversight. Perhaps I should rephrase the docs.