/CJKReadings

Korean Readings in CJK Unified Ideographs Extension

MIT LicenseMIT

Objective

In Unicode, there are not Korean readings in CJK Unified Ideographs Extension, as known as Ext. Series, which contain Hanzi/Kanji/Hanja/Tu Han. By this reason, I added Korean readings with "Kangxi Dictionary" to both fanqie(反切) reading and Hanzi data. Also, "Unicode Hanja Information Dictionary of Variant Characters" provide to many Korean reading for the information to fill with it.

목적

유니코드에 등록된 한중일월 한자 중에 한중일 통합 한자 확장(CJK Unified Ideographs Extension) 영역의 한국어 음가가 없는 것을 보고, 강희자전(康煕字典)에 기반한 데이터에 준거해서 한자 반절(反切) 읽기 및 발음, 그리고 유니코드 한자 이체자 정보 사전을 통하여 한글 음가를 정리하였습니다.

目的

Unicodeに登録された日中韓越漢字から、日中韓統合漢字拡張(CJK Unified Ideographs Extension)領域のハングル表記が全くないことを見て、康煕字典に基づいてデータを築き、漢字の反切から発音をはかどりました。それに合わせてユニコード漢字異体字情報辞典からハングルの読みをまとめました。

Supported Glyphs

  • CJK Unified Ideographs Extension B (39,397 glyphs; 92.24% cover)
  • CJK Unified Ideographs Extension C (3,205 glyphs; 77.25% cover)
  • CJK Unified Ideographs Extension D (222 glyphs; All cover)
  • CJK Unified Ideographs BMP Add 69 Glyphs (Unicode 10.0)

History

  • 2018/2/19 - Update ExtB Reading; Add ExtC, ExtD, BMPAdd Reading
  • 2016/8/19 - Change current license into MIT License
  • 2016/5/18 - Add readings from Vietnamese, seperated CJK Extended Compatiblity file from ExtB.txt
  • 2016/5/7 - Add Korean readings derived from Chinese or Vietnamese
  • 2016/2/26 - Add 1,654 characters
  • 2016/2/25 - Add 28,762 characters (67.34% of all characters at that time)

License

MIT License

Contact

References