Some characters seem to get missed
Closed this issue · 1 comments
danielat998 commented
When running, may chanracters seem to be getting missed - in both GUI and command line. For example, the String "我是人“ yields only '我‘ and ‘人’. '是' seems somehow to get lost in the mix.
danielat998 commented
I actually never fully reached the bottom of the reason for this, but the debugging I did do suggests that it was something weird to do with unicode character ordering. I have rewritten the code to use a HashMap, which had been discussed before and I think is a better design decision anyway - in particular when it comes to re-adding support for traditional characters, as well as possibly searching by pinyin/English.
Side note:This is something that would have been picked up much sooner if we had written some unit tests...