boatbomber/Highlighter

Lexer matches unicode, but doesn't catch it.

Exunys opened this issue · 2 comments

Title says it all, I am using src/lexer/init.lua. Here's a quick example:

⭮⽥⻶ⓞ⠏Ⱈ⵼⇠⺀❜⍨ↇ⡧⚏ⷔ⸂␳⋡⒀⻒⪻ⅆ⿜⌰⤟ⵣ⏗⸂⚿▚⪪⿐⽻⥝⁹⚎Ⓥ◥◶℀₏Ⅻ will get matched, but not catched properly.

print("⭮⽥⻶ⓞ⠏Ⱈ⵼⇠⺀❜⍨ↇ⡧⚏ⷔ⸂␳⋡⒀⻒⪻ⅆ⿜⌰⤟ⵣ⏗⸂⚿▚⪪⿐⽻⥝⁹⚎Ⓥ◥◶℀₏Ⅻ")

Will catch: "⭮⽥⻶ⓞ⠏Ⱈ⵼⇠⺀❜⍨ↇ⡧⚏ⷔ⸂␳⋡⒀⻒⪻ⅆ⿜⌰⤟ⵣ⏗⸂⚿▚⪪⿐⽻⥝⁹⚎Ⓥ◥◶℀₏Ⅻ")

It returns the unicode with the suffix and ) but it's not suppose to.

Here's how it looks like:

image

Here is how it should look like:

image

Thanks for bringing this up. In fact, it was not an issue with the Unicode matcher but actually with the string matcher. A fix will be pushed shortly.

Fixed, thank you!