/SCPapers

Must-read papers on Sememe Computation

MIT LicenseMIT

Must-read papers on Sememe Computation

Contributed by Fanchao Qi.

Introduction

Sememes are defined as the minimum indivisible semantic units of meaning. Some people believe that semantic meanings of all the concepts including words can be composed of a limited closed set of sememes.

As a result, sememes can help us comprehend human languages better. In fact, some works have proved that some neural NLP models perform better with incorporation of sememe knowledge.

HowNet, initially published in 2000, is the most famous sememe-based linguistic knowledge base.

Papers

Introduction to HowNet

  1. Theoretical Findings of HowNet. Zhendong Dong, Qiang Dong and Changling Hao. JCIP 2007. [pdf (Chinese)]

This paper gives a summary of theoretical findings about knowledge of HowNet.

  1. HowNet - a hybrid language and knowledge resource. Zhendong Dong and Qiang Dong. NLP-KE 2003. [pdf]

This paper gives a brief introduction to HowNet.

  1. Introduction to HowNet. Zhendong Dong and Qiang Dong. [pdf (English)] [pdf (Chinese)]

This paper gives an overall introduction to HowNet, including its features, philosophy and constructing method.

  1. KDML — 知网知识系统描述语言. Zhendong Dong and Qiang Dong. [pdf (Chinese)]

This paper gives a detailed introduction to Knowledge Database Mark-up Language, the mark-up language used in HowNet.

Expansion of sememe-based knowledge bases

  1. Cross-lingual Lexical Sememe Prediction. Fanchao Qi, Yankai Lin, Maosong Sun, Hao Zhu, Ruobing Xie and Zhiyuan Liu. EMNLP 2018. [pdf] [codes]

  2. Sememe Prediction: Learning Semantic Knowledge from Unstructured Textual Wiki Descriptions. Wei Li, Xuancheng Ren, Damai Dai, Yunfang Wu, Houfeng Wang and Xu Sun. arXiv 2018. [pdf] [codes]

  3. Extended HowNet 2.0 – An Entity-Relation Common-Sense Representation Model. Wei-Yun Ma and Yueh-Yin Shih. LREC 2018. [pdf]

  4. Incorporating Chinese Characters of Words for Lexical Sememe Prediction. Huiming Jin, Hao Zhu, Zhiyuan Liu, Ruobing Xie, Maosong Sun, Fen Lin and Leyu Lin. ACL 2018. [pdf] [codes]

  5. Lexical Sememe Prediction via Word Embeddings and Matrix Factorization. Ruobing Xie, Xingchi Yuan, Zhiyuan Liu and Maosong Sun. IJCAI 2017. [pdf] [codes]

  6. E-HowNet and Automatic Construction of a Lexical Ontology. Wei-Te Chen, Su-Chu Lin, Shu-Ling Huang, You-Shan Chung and Keh-Jiann Chen. COLING 2010. [pdf]

  7. Extended-HowNet- A Representational Framework for Concepts. Keh-Jiann Chen, Shu-Ling Huang, Yueh-Yin Shih and Yi-Jun Chen. OntoLex 2005. [pdf]

Applications

  1. Language Modeling with Sparse Product of Sememe Experts. Yihong Gu, Jun Yan, Hao Zhu, Zhiyuan Liu, Ruobing Xie, Maosong Sun, Fen Lin and Leyu Lin. EMNLP 2018. [pdf] [codes]

  2. Chinese LIWC Lexicon Expansion via Hierarchical Classification of Word Embeddings with Sememe Attention. Xiangkai Zeng, Cheng Yang, Cunchao Tu, Zhiyuan Liu and Maosong Sun. AAAI 2018. [pdf] [codes]

  3. Evaluating Semantic Rationality of a Sentence: A Sememe-Word-Matching Neural Network based on HowNet. Shu Liu, Jingjing Xu, Xuancheng Ren and Xu Sun. arXiv 2018. [pdf]

  4. Improved Word Representation Learning with Sememes. Yilin Niu, Ruobing Xie, Zhiyuan Liu and Maosong Sun. ACL 2017. [pdf] [codes]

  5. Embedding for Words and Word Senses Based on Human Annotated Knowledge Base: A Case Study on HowNet. Maosong Sun and Xinxiong Chen. JCIP 2016. [pdf (Chinese)]

  6. Multi-aspect sentiment analysis for Chinese online social reviews based on topic modeling and HowNet lexicon. Xianghua Fu, Guo Li, Yanyan Guo and Zhiqiang Wang. Knowledge-Based Systems 2013. [pdf]

  7. Method of discriminant for Chinese sentence sentiment orientation based on HowNet. Lei Dang and Lei Zhang. Application Research of Computers 2010. [pdf (Chinese)]

  8. Semantic orientation computing based on HowNet. Yanlan Zhu, Jin Min, Yaqian Zhou, Xuanjing Huang and Lide Wu. JCIP 2005. [pdf (Chinese)]

  9. Chinese word sense disambiguation using HowNet. Yuntao Zhang, Ling Gong and Yongcheng Wang. ICNC 2005. [pdf]

  10. Word Similarity Computing Based on HowNet. Qun Liu and Sujian Li. International Journal of Computational Linguistics & Chinese Language Processing 2002. [pdf (Chinese)]