Unicode range validation test:
Closed this issue · 3 comments
Unicode range validation test:
All these ranges contain legitimate Chinese characters which may appear in the Etymology table. Following are test characters from each range.
一 CJK Unified Ideograms Unified Ideographs: (U+4E00 to U+9FFF)
㐦 CJK Ideographs Extension A: (U+3400 to U+4DBF)
𠀀 CJK Ideographs Extension B: (U+20000 to U+2A6DF)
𪜀 CJK Ideographs Extension C: (U+2A700 to U+2B73F)
𫝀 CJK Ideographs Extension D: (U+2B740 to U+2B81F)
“丽” CJK Comparability Ideographs Supplement: (U+2F800 to U+2FA1F)
⺀ CJK Radicals Supplement: (U+2E80 to U+2EFF)
⼀ Kangxi Radicals: (U+2F00 to U+2FDF)
“⿰” Ideographic Description Characters: (U+2FF0 to U+2FFF)
〥 CJK Symbols and Punctuation: (U+3000 to U+303F)
い Hiragana: (U+3040 to U+309F)
ア Katakana: (U+30A0 to U+30FF)
ㄆ Bopomofo: (U+3100 to U+312F)
ㆡ Bopomofo Extended: (U+31A0 to U+31BF)
㇏ CJK Strokes: (U+31C0 to U+31EF) Legitimize
ㇰ Katakana Phonetic Extensions: (U+31F0 to U+31FF)
豈 CJK Compatibility Ideographs: (U+F900 to U+FAFF)
︽ CJK Compatibility Forms: (U+FE30 to U+FE4F)
ャ Half width and Full width Forms: (U+FF00 to U+FFEF)
I do not have all of these test characters in all of these code ranges, I am just saying that each of these code ranges may have legitimate characters that are now or may in the future be in my table.