Vietnamese Word Tokenizer This is a fork of the code from http://mim.hus.vnu.edu.vn/dsl/tools/tokenizer