
Must-read Papers on Sememe Computation

A sememe is defined as the minimum semantic unit in linguistics. Some linguists believe that meanings of all words can be decomposed of a limited set of sememes.

Sememes can help us comprehend human languages better. Some studies have proved that neural NLP models benefit from the incorporation of sememes.

HowNet is the most famous sememe-based knowledge base. It predefines a set of 2,000 sememes and uses them to annotate over 100,000 Chinese and English words.

OpenHowNet, developed by THUNLP, opens source core data of HowNet and provides convenient data access APIs.



Expansion of Sememe Knowledge Bases

