shabados/gurmukhi-utils

Add Unicode Nukta Character combinations to toAscii

sarabveer opened this issue · 0 comments

Describe the bug

Unicode allows two ways to type nukta characters.

First is the defined code point, (U+0A36) which is a single character. The other method is to add a nukta char (U+0A3C) to an existing char, such as ਸ਼ (U+0A38 + U+0A3C)

Since nutka is mapped to æ, the conversion results in instead of S

Expected behavior
A clear and concise description of what you expected to happen.

ਸ਼ (U+0A38 + U+0A3C) => S
...and so on for other Pair Bindi chars