High performance chinese tokenizer with both GBK and UTF-8 charset support developed by ANSI C
Primary LanguageCApache License 2.0Apache-2.0
No issues in this repository yet.