EamonnCarson/suffix
Fast suffix arrays for Rust (with Unicode support). Modified to support the operations described in "Deduplicating Training Data Makes Language Models Better" arXiv:2107.06499v2
RustUnlicense
Watchers
No one’s watching this repository yet.