Chinese datasets used in CIKM 2018 paper "Short Text Entity Linking with Fine-grained Topics"

file format:

  1. text.txt: text file
  2. annot.txt: annotation file, json
  3. pred.txt: predicate file, for HQA dataset, predicates (separated by "|") for the subject of each line

If you want to cite our work, please use this publication:
Lihan Chen, Jiaqing Liang, Chenhao Xie, Yanghua Xiao. 2018. Short Text Entity Linking with Fine-grained Topics. In The 27th ACM International Conference on Information and Knowledge Management (CIKM 18), Oct. 22-26, 2018, Torino, Italy. ACM, New York, NY, USA, 10 pages. s. https://doi.org/10.1145/3269206.3271809