This is a Python package to work with Chinese characters, also calle Hanzi (汉字). Currently implemented is the following:
- Dictionary lookup via CC-CEDICT
- Working with Remembering Simplified Hanzi
from hanzitools import Heisig
heisig = Heisig()
# Look up by number
heisig.lookup_number(1755) # Entry(character='杨', keyword='poplar', number=1755, also_known_as=[], parts=['tree', 'piglets'])
# Look up by keyword
heisig.lookup_keyword("ocean") # Entry(character='洋', keyword='ocean', number=530, also_known_as=[], parts=['water', 'sheep'])
# Look up by character
heisig.lookup_character("力") # HeisigEntry(character='力', keyword='power', is_primitive=False, number=732, also_known_as=['muscle', 'power'], parts=[])
from hanzitools import Cedict
cedict = Cedict()
cedict.lookup_simplified("约")
cedict.lookup_traditional("翰")
Translate Chinese characters to pinyin. This is currently very basic. For a better library, maybe use mozillazg/python-pinyin.
from hanzitools import pinyinify
pinyinify("转换汉字为拼音。")
pinyinify("你好!你今天吃饭了没?")
The Heisig data is owned by J. W. Heisig & T. W. Richardson, Honolulu: University of Hawai’i Press. We use the repository from rouseabout/heisig which provides a XML database of the Heisig data (MIT License). We use the CC-CEDICT as our go-to dictionary (CC BY-SA 3.0).