Markdown documents saved with different encodings that contain specific character tables and text examples. Use them to test encoding/decoding algorithms. Each document is also saved in generally accepted UTF-8 encoding so you can test conversion by loading both versions and comparing resulting strings.
- KOI8-R
- KOI8-RU
- KOI8-T
- KOI8-U
- Windows-1251
- Windows-1252
- more to add...
Additional binary file contains all possible bytes from 0
to 255
to test loading file as binary string.
- http://clagnut.com/blog/2380
- https://ru.wikipedia.org/wiki/%D0%9A%D0%9E%D0%98-8
- https://en.wikipedia.org/wiki/Tajik_alphabet
- https://be.wikipedia.org/wiki/%D0%9F%D0%B0%D0%BD%D0%B3%D1%80%D0%B0%D0%BC%D0%B0
- https://ru.wikipedia.org/wiki/Windows-1251
- https://scratchpad.fandom.com/wiki/Character_Encoding_Recommendation_for_Languages
- https://ru.wikipedia.org/wiki/ISO_8859-1
Feel free to add samples for uncovered encodings and suggest improvements and fixes to existing samples.