EamonnCarson/suffix
Fast suffix arrays for Rust (with Unicode support). Modified to support the operations described in "Deduplicating Training Data Makes Language Models Better" arXiv:2107.06499v2
RustUnlicense
Stargazers
No one’s star this repository yet.