/suffix

Fast suffix arrays for Rust (with Unicode support). Modified to support the operations described in "Deduplicating Training Data Makes Language Models Better" arXiv:2107.06499v2

Primary LanguageRustThe UnlicenseUnlicense

Watchers

No one’s watching this repository yet.