Some CN glyphs are showing inappropriate JP or TW/HK forms for GB 18030
Marcus98T opened this issue · 0 comments
This is the Serif counterpart to these two issues on Sans, one for TW/HK and one for JP.
Yes, Source Han Serif fulfils the GB 18030 character set (and once again, they do not define glyph standards), but the glyph shape for some characters does not follow the PRC glyph conventions. Obviously, due to limited glyph space, we will never be able to get all GB 18030 characters to follow the PRC glyph conventions, and those affected characters are actually more important to Japanese, Taiwanese and Hong Kong standards than to Mainland Chinese standards.
As Dr. Ken Lunde said,
And yes, I am fully aware that some single-source ideographs have multiple region-specific glyphs. When push comes to shove, which was necessary for the Version 2.000 update, such glyphs are removed in order to make room for higher-priority glyphs.
Also, I'd like to point out that following standards such as GB/T 22321.1-2018 is not within the scope of the Source Han projects. I think of such standards as attempts to hammer square pegs into round holes, meaning that regional conventions are applied to ideographs that are not actually used in that particular region. It would be nice to do, but when dealing with a glyph set that is already full, practicality becomes necessary.
Originally posted by @kenlunde in adobe-fonts/source-han-sans#204 (comment)
Thankfully, there are less glyphs that do not follow PRC glyph conventions in Serif than in Sans, but still, I think this issue can be solved by merging non-essential regional differences as mentioned in this issue.
This is for reference only. It is unlikely that any action will be taken by Adobe, but still, please do not close this issue. The reason why a similar issue was closed was because the OP raised the issue for only two characters, but this one is much more comprehensive.
Also please see this issue with regards to restoring essential glyphs required for CN use (which does not cover all of GB 18030, only the most basic standards that are required).
These lists will only cover the Basic Multilingual Plane, so basically CJK Unified Ideographs and CJK Extension A only, which is what GB 18030 is about. Any characters in the Supplementary Ideographic Plane (aka CJK Ext. B and higher) will not be accounted for. I might provide a pictorial accompaniment in a future edit.
CN glyphs showing JP forms
These lists are based on removed v1 CN glyphs, sorted according to the Japanese JIS standard levels. Which means that I may not catch every non-compliant GB 18030 character which never had a CN glyph.
This time, I will only note whether the characters are non-compliant with the PRC glyph conventions, but will not suggest to restore them as they are subjective. It’s up to Adobe to decide if they want to restore them.
If I mention "regional design issue", it means that while technically the glyph may seem to follow PRC glyph conventions, Japanese conventions may deviate slightly from the PRC conventions, for example, by deciding whether certain vertical strokes must connect to the other components or not, and even then, the stroke-touching may probably be inconsistent across different JP-designed characters.
JIS Level 1
Unicode | Character | Non-compliant? |
---|---|---|
U+4E0D | 不 | |
U+4E98 | 亘 | |
U+5024 | 値 | Yes |
U+5141 | 允 | |
U+5177 | 具 | |
U+51CB | 凋 | |
U+51F1 | 凱 | Yes |
U+5351 | 卑 | |
U+535A | 博 | |
U+559C | 喜 | |
U+5618 | 嘘 | |
U+565B | 噛 | Yes |
U+57E0 | 埠 | |
U+5835 | 堵 | |
U+583A | 堺 | |
U+5C0F | 小 | |
U+5F6B | 彫 | |
U+5FA9 | 復 | |
U+5FB3 | 徳 | Yes |
U+5FC5 | 必 | |
U+601D | 思 | |
U+6075 | 恵 | |
U+6089 | 悉 | |
U+60AA | 悪 | |
U+60B6 | 悶 | |
U+60D1 | 惑 | |
U+60E3 | 惣 | |
U+611B | 愛 | |
U+6163 | 慣 | |
U+616E | 慮 | |
U+6182 | 憂 | |
U+6301 | 持 | |
U+635C | 捜 | |
U+6973 | 楳 | |
U+6E20 | 渠 | Yes |
U+732A | 猪 | |
U+754C | 界 | |
U+7570 | 異 | |
U+7A0B | 程 | |
U+7A42 | 穂 | |
U+7A4F | 穏 | |
U+8133 | 脳 | Yes |
U+819A | 膚 | |
U+81D3 | 臓 | Yes |
U+8449 | 葉 | Yes |
U+8511 | 蔑 | |
U+85AC | 薬 | |
U+865C | 虜 | |
U+8CB0 | 貰 | Yes |
U+8D77 | 起 | |
U+961C | 阜 | |
U+99C6 | 駆 | Yes |
JIS Level 2
Unicode | Character | Non-compliant? |
---|---|---|
U+51A4 | 冤 | |
U+53A6 | 厦 | |
U+5632 | 嘲 | |
U+568F | 嚏 | |
U+5F99 | 徙 | |
U+5FDD | 忝 | |
U+606A | 恪 | |
U+613C | 愼 | Yes |
U+6187 | 憇 | |
U+6191 | 憑 | |
U+6199 | 憙 | |
U+62FF | 拿 | |
U+63A3 | 掣 | |
U+6641 | 晁 | |
U+6763 | 杣 | Yes |
U+67B3 | 枳 | |
U+69C7 | 槇 | Yes |
U+69DD | 槝 | Yes |
U+6DE6 | 淦 | |
U+6F91 | 澑 | Yes |
U+6FFE | 濾 | |
U+7018 | 瀘 | |
U+701F | 瀟 | Yes |
U+70FD | 烽 | |
U+7162 | 煢 | |
U+7199 | 熙 | Yes |
U+7210 | 爐 | |
U+732F | 猯 | Yes |
U+73F1 | 珱 | Yes |
U+76BA | 皺 | Yes |
U+76E7 | 盧 | |
U+771E | 眞 | Yes |
U+77B9 | 瞹 | |
U+7C2B | 簫 | Yes |
U+7CAB | 粫 | |
U+8085 | 肅 | Yes |
U+81DA | 臚 | |
U+83B5 | 莵 | |
U+856D | 蕭 | Yes |
U+8606 | 蘆 | |
U+8655 | 處 | |
U+876E | 蝮 | |
U+89AC | 覬 | Yes |
U+8D05 | 贅 | |
U+8DFF | 跿 | |
U+8F0C | 輌 | Yes |
U+9112 | 鄒 | Yes |
U+941A | 鐚 | Yes |
U+958A | 閊 | Yes |
U+984B | 顋 | |
U+9871 | 顱 | |
U+9A65 | 驥 | |
U+9C04 | 鰄 | Yes |
U+9C08 | 鰈 | Yes |
U+9DC6 | 鷆 | Yes |
JIS Level 3
Unicode | Character | Non-compliant? |
---|---|---|
U+34B5 | 㒵 | |
U+5307 | 匇 | |
U+5861 | 塡 | Yes |
U+6018 | 怘 | |
U+6DFC | 淼 | |
U+7681 | 皁 | |
U+784F | 硏 | Yes |
U+7C01 | 簁 | |
U+8168 | 腨 | Yes |
U+865B | 虛 | Yes |
U+8F2D | 輭 | |
U+95D3 | 闓 | Yes |
U+9853 | 顓 | Yes |
U+9856 | 顖 | |
U+9857 | 顗 | Yes |
U+985A | 顚 | Yes |
U+9B9E | 鮞 | |
U+9BCE | 鯎 | Yes |
U+9C50 | 鱐 | Yes |
JIS Level 4
Unicode | Character | Non-compliant? |
---|---|---|
U+3775 | 㝵 | Regional design issue |
U+3DC0 | 㷀 | Regional design issue |
U+4E40 | 乀 | Yes |
U+5010 | 倐 | Yes |
U+5342 | 卂 | Regional design issue |
U+57F8 | 埸 | |
U+5827 | 堧 | |
U+5D42 | 嵂 | Yes |
U+5FC4 | 忄 | |
U+5FE2 | 忢 | |
U+611E | 愞 | |
U+6197 | 憗 | |
U+61DF | 懟 | |
U+63D4 | 揔 | |
U+682D | 栭 | |
U+69D6 | 槖 | |
U+6C35 | 氵 | |
U+6C3A | 氺 | |
U+6EB4 | 溴 | |
U+7000 | 瀀 | |
U+7065 | 灥 | |
U+7153 | 煓 | Yes |
U+71DC | 燜 | |
U+789D | 碝 | |
U+7BC5 | 篅 | Yes |
U+7BD6 | 篖 | Yes |
U+800E | 耎 | |
U+8011 | 耑 | Yes |
U+81EC | 臬 | Regional design issue |
U+8279 | 艹 | |
U+84F0 | 蓰 | |
U+84FD | 蓽 | Regional design issue |
U+8586 | 薆 | |
U+899F | 覟 | |
U+8F00 | 輀 | |
U+97A2 | 鞢 | Yes |
U+97F1 | 韱 | Regional design issue |
U+9C6B | 鱫 | |
U+9D11 | 鴑 | Yes |
U+9D76 | 鵶 | Yes |
U+9DB5 | 鶵 | Yes |
Other Non-JIS Characters
Most of them are part of Adobe-Japan1-6.
Also, when I checked the removed CN glyph lists, some of them ended up showing TW/HK glyphs in v2 instead of JP, which I have edited out and moved them to the below section.
Unicode | Character | Non-compliant? |
---|---|---|
U+2E8C | ⺌ | |
U+322B | ㈫ | |
U+323E | ㈾ | Yes |
U+3240 | ㉀ | Regional design issue |
U+32AE | ㊮ | Yes |
U+3689 | 㚉 | Yes |
U+3935 | 㤵 | |
U+3D35 | 㴵 | |
U+3E8A | 㺊 | |
U+4576 | 䕶 | Yes |
U+4E41 | 乁 | Regional design issue |
U+4E44 | 乄 | Regional design issue |
U+4E97 | 亗 | Yes |
U+4EFE | 仾 | Yes |
U+4F95 | 侕 | |
U+5044 | 偄 | |
U+524E | 剎 | Yes |
U+528C | 劌 | |
U+52F4 | 勴 | |
U+5365 | 卥 | |
U+53D0 | 叐 | Regional design issue |
U+5422 | 吢 | |
U+57CA | 埊 | Yes |
U+596F | 奯 | |
U+5B5E | 孞 | |
U+5D9A | 嶚 | Yes |
U+5DC8 | 巈 | Yes |
U+5FC8 | 忈 | |
U+5FCE | 忎 | |
U+5FD0 | 忐 | |
U+5FE5 | 忥 | |
U+6023 | 怣 | |
U+6038 | 怸 | |
U+6056 | 恖 | |
U+60A1 | 悡 | |
U+60C0 | 惀 | |
U+60C4 | 惄 | |
U+60C9 | 惉 | |
U+60CE | 惎 | |
U+60E2 | 惢 | |
U+6102 | 愂 | |
U+6117 | 愗 | |
U+6133 | 愳 | |
U+617F | 慿 | |
U+6895 | 梕 | |
U+689E | 梞 | |
U+6950 | 楐 | |
U+6A37 | 樷 | Yes |
U+6A4A | 橊 | Yes |
U+6A85 | 檅 | |
U+6C56 | 汖 | Yes |
U+6D0F | 洏 | |
U+6DB6 | 涶 | |
U+6DF4 | 淴 | |
U+6F7B | 潻 | Yes |
U+6FCF | 濏 | Yes |
U+703B | 瀻 | |
U+7157 | 煗 | |
U+719A | 熚 | Yes |
U+7208 | 爈 | |
U+7217 | 爗 | Yes |
U+724E | 牎 | |
U+7293 | 犓 | Yes |
U+7343 | 獃 | Yes |
U+7369 | 獩 | |
U+7379 | 獹 | |
U+74C7 | 瓇 | |
U+74D0 | 瓐 | |
U+7575 | 畵 | Yes |
U+76C1 | 盁 | Yes |
U+7796 | 瞖 | Yes |
U+77B8 | 瞸 | Yes |
U+77D1 | 矑 | |
U+782A | 砪 | Yes |
U+7878 | 硸 | Yes |
U+787E | 硾 | Yes |
U+788B | 碋 | Yes |
U+7916 | 礖 | |
U+7F73 | 罳 | |
U+8009 | 耉 | Yes |
U+800F | 耏 | |
U+80F7 | 胷 | Yes |
U+80F9 | 胹 | |
U+815D | 腝 | |
U+81D4 | 臔 | Yes |
U+8419 | 萙 | |
U+8550 | 蕐 | Yes |
U+8659 | 虙 | |
U+8745 | 蝅 | Yes |
U+8761 | 蝡 | |
U+89A8 | 覨 | Yes |
U+8D72 | 赲 | |
U+8D82 | 趂 | |
U+8D88 | 趈 | |
U+8DE5 | 跥 | Yes |
U+8E5D | 蹝 | Regional design issue |
U+90C9 | 郉 | Yes |
U+9193 | 醓 | |
U+921F | 鈟 | Yes |
U+936F | 鍯 | |
U+9402 | 鐂 | Yes |
U+9459 | 鑙 | |
U+967E | 陾 | |
U+967F | 陿 | Yes |
U+9691 | 隑 | Yes |
U+98C5 | 飅 | Yes |
U+9A51 | 驑 | Yes |
U+9B94 | 鮔 | Yes |
U+9BD0 | 鯐 | Regional design issue |
U+9D2F | 鴯 | |
U+9DEB | 鷫 | Yes |
CN glyphs showing TW/HK forms
Unicode | Character | Non-compliant? | Notes | Use HK glyph (if cannot restore JP/CN glyph)? |
---|---|---|---|---|
U+42A2 | 䊢 | CN glyph removed in v2 and a similar looking HK glyph is in its place | ||
U+6287 | 抇 | Yes | There was no JP or CN glyph in v1 | Yes (only for CN) |
U+6423 | 搣 | Yes | JP glyph removed in v2 | |
U+683A | 栺 | Yes | JP glyph removed in v2 | |
U+6B76 | 歶 | Yes | JP glyph removed in v2 | Yes (also applies to JP and KR) |
U+6E2A | 渪 | Yes | JP glyph removed in v2 | |
U+6E61 | 湡 | Yes | JP glyph removed in v2 | |
U+6F7F | 潿 | Yes | CN glyph removed in v2 | |
U+6FBF | 澿 | Yes | JP glyph removed in v2 | Already using it |
U+715F | 煟 | Yes | JP glyph removed in v2 | Already using it |
U+717C | 煼 | Yes | JP glyph removed in v2, however, there was no CN glyph in v1 | |
U+7182 | 熂 | Yes | JP glyph removed in v2 | Already using it |
U+71E4 | 燤 | Yes | JP glyph removed in v2 | |
U+724A | 牊 | Yes | JP glyph removed in v2 | |
U+72C5 | 狅 | Yes | JP glyph removed in v2, which could have been used for HK | Yes (also applies to JP and KR) |
U+735B | 獛 | Yes | JP glyph removed in v2 | |
U+77A8 | 瞨 | Yes | JP glyph removed in v2 | |
U+77CE | 矎 | Yes | JP glyph removed in v2 | |
U+78F1 | 磱 | Yes | JP glyph removed in v2 | |
U+78FD | 磽 | |||
U+7AFC | 竼 | Yes | JP glyph removed in v2 | |
U+7B01 | 笁 | Yes | JP glyph removed in v2 | |
U+7B1A | 笚 | Yes | JP glyph removed in v2 | |
U+7B1D | 笝 | Yes | JP glyph removed in v2 | |
U+7B44 | 筄 | Yes | JP glyph removed in v2 | |
U+7B63 | 筣 | Yes | JP glyph removed in v2 | |
U+7B78 | 筸 | Yes | JP glyph removed in v2 | |
U+7B82 | 箂 | Yes | JP glyph removed in v2 | |
U+7BA3 | 箣 | Yes | JP glyph removed in v2 | |
U+7BD8 | 篘 | Yes | CN glyph removed in v2 | |
U+7BE2 | 篢 | Yes | JP glyph removed in v2 | |
U+7BFB | 篻 | Yes | JP glyph removed in v2 | Yes (also applies to JP and KR) |
U+7BFD | 篽 | Yes | JP glyph removed in v2 | |
U+7BFF | 篿 | Yes | JP glyph removed in v2 | |
U+7C0A | 簊 | Yes | JP glyph removed in v2 | |
U+7C1D | 簝 | Yes | JP glyph removed in v2 | |
U+7C35 | 簵 | Yes | JP glyph removed in v2 | |
U+7C8E | 粎 | Yes | JP glyph removed in v2 | |
U+7CD0 | 糐 | Yes | JP glyph removed in v2 | Already using it |
U+8012 | 耒 | Using JP variant glyph uni8012uE0101-JP for CN | ||
U+81AB | 膫 | Yes | JP glyph removed in v2 | |
U+8215 | 舕 | Yes | JP glyph removed in v2 | Yes (also applies to JP and KR) |
U+8381 | 莁 | Yes | JP glyph removed in v2 | |
U+83C4 | 菄 | Yes | JP glyph removed in v2 | |
U+83CB | 菋 | Yes | JP glyph removed in v2 | |
U+8E3F | 踿 | Yes | JP glyph removed in v2 | |
U+9FA6 | 龦 | Yes | No Unicode reference for CN | |
U+9FA7 | 龧 | No Unicode reference for CN | ||
U+9FA8 | 龨 | Yes | No Unicode reference for CN, also see this issue to revert to the v1 glyph | |
U+9FA9 | 龩 | Yes | No Unicode reference for CN | |
U+9FAA | 龪 | No Unicode reference for CN | ||
U+9FAB | 龫 | No Unicode reference for CN | ||
U+9FAC | 龬 | Yes | No Unicode reference for CN | |
U+9FAD | 龭 | No Unicode reference for CN | ||
U+9FAE | 龮 | Yes | No Unicode reference for CN | |
U+9FAF | 龯 | No Unicode reference for CN | ||
U+9FB0 | 龰 | No Unicode reference for CN | ||
U+9FB1 | 龱 | No Unicode reference for CN | ||
U+9FB2 | 龲 | No Unicode reference for CN | ||
U+9FB3 | 龳 | Yes | No Unicode reference for CN | |
U+9FC7 | 鿇 | CN glyph removed in v2, but HK glyph looks identical | ||
U+9FC8 | 鿈 | Yes | CN glyph removed in v2 | |
U+9FC9 | 鿉 | CN glyph removed in v2 and a similar looking HK glyph is in its place | ||
U+9FCA | 鿊 | Yes | CN glyph removed in v2 | |
U+9FCB | 鿋 | CN glyph removed in v2 and a similar looking HK glyph is in its place | ||
U+9FD0 | 鿐 | Yes | CN glyph removed in v2 |