joddie/pcre2el

ICU regular expressions

Opened this issue · 0 comments

i'm a novice when it comes to non-elisp regexes, but i'm interested in converting ICU regexes to el, either via PCRE and then this package or directly.

does anyone around these parts know if there is much of a difference between PCRE and ICU? would it be plausible for me to modify this library to get a direct conversion working?

i was hoping to parse SRX sentence rules (http://okapiframework.org/wiki/index.php?title=SRX) for sophisticated sentence ending and non-ending rules in various languages, mainly for translation. The Okapi Foundation has a collection available free to use, but they are done as ICU regexes.