/tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Primary LanguageRustApache License 2.0Apache-2.0

Pinned issues

Training a model from in-memory data

#198 opened by loicbarrault

Closed1

ByteLevelBPETokenizer output seems weird

#203 opened by seyyaw

Open2

Issues