loretoparisi opened this issue 10 months ago · 0 comments
Implement a "token-free" or tokenization free encoder to work at Unicode/UTF-8 character-level.
Examples