High performance chinese tokenizer with both GBK and UTF-8 charset support developed by ANSI C
Primary LanguageCApache License 2.0Apache-2.0
No one’s watching this repository yet.