/test-corpora

Corpora in many languages for testing, evaluating, benchmarking, and training Unicode algorithms

Primary LanguageHTMLOtherNOASSERTION

Unicode Test Corpora

Corpora in many languages for testing, evaluating, benchmarking, and training Unicode algorithms

Copyright & Licenses

Copyright © 2023-2024 Unicode, Inc. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the United States and other countries.

The project is released under LICENSE.

A CLA is required to contribute to this project - please refer to the CONTRIBUTING.md file (or start a Pull Request) for more information.