ArthurZucker/minbpe
Minimal, clean, educational code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
PythonMIT
Watchers
No one’s watching this repository yet.
Minimal, clean, educational code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
PythonMIT
No one’s watching this repository yet.