/unilib

Embeddable C++ Unicode library offering utf-8 and utf-16 encoding, general category info, simple lowercasing and uppercasing and titlecasing, normalization forms and combining marks stripping.

Primary LanguageC++Mozilla Public License 2.0MPL-2.0

UniLib: C++17 Embeddable Unicode Library

Tests

UniLib is an embeddable C++17 Unicode library under MPL 2.0 license. It is versioned using Semantic Versioning.

If currently offers:

  • utf-8 and utf-16 encodings
  • Unicode General Category info
  • simple and full lower/title/uppercasing
  • Unicode normalization forms
  • efficient stripping of combining marks

It is tested on the following compilers and platforms:

  • Linux using gcc and clang
  • Windows using Visual C++ 2019 and newer
  • macOS using clang

Copyright 2014-2023 Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University in Prague, Czech Republic.

UniLib repository http://github.com/ufal/unilib is hosted on GitHub.