Add Unicode Nukta Character combinations to toAscii
sarabveer opened this issue · 0 comments
sarabveer commented
Describe the bug
Unicode allows two ways to type nukta characters.
First is the defined code point, ਸ਼
(U+0A36) which is a single character. The other method is to add a nukta char ਼
(U+0A3C) to an existing char, such as ਸ਼
(U+0A38 + U+0A3C)
Since nutka is mapped to æ
, the conversion results in sæ
instead of S
Expected behavior
A clear and concise description of what you expected to happen.
ਸ਼
(U+0A38 + U+0A3C) => S
...and so on for other Pair Bindi chars