/suffix

Fast suffix arrays for Rust (with Unicode support). Modified to support the operations described in "Deduplicating Training Data Makes Language Models Better" arXiv:2107.06499v2

Primary LanguageRustThe UnlicenseUnlicense

Stargazers

No one’s star this repository yet.