/cmrc2018

A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)

Primary LanguagePythonCreative Commons Attribution Share Alike 4.0 InternationalCC-BY-SA-4.0

A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC2018)

This repository contains the data for The Second Evaluation Workshop on Chinese Machine Reading Comprehension (CMRC 2018). We will present our paper on EMNLP-IJCNLP 2019.

Title: A Span-Extraction Dataset for Chinese Machine Reading Comprehension
Authors: Yiming Cui, Ting Liu, Wanxiang Che, Li Xiao, Zhipeng Chen, Wentao Ma, Shijin Wang, Guoping Hu
Link: https://arxiv.org/abs/1810.07366
Venue: EMNLP-IJCNLP 2019

Open Challenge Invitation

The Second Evaluation Workshop on Chinese Machine Reading Comprehension was succesfully ended. The evaluation committee had decided to continue to accept submissions to further evaluations on the hidden test set and challenge set.

CMRC 2018 Public Datasets

Please download CMRC 2018 public datasets via the following CodaLab worksheet.
https://worksheets.codalab.org/worksheets/0x92a80d2fab4b4f79a2b4064f7ddca9ce

Submission Guidelines

If you would like to test your model on the hidden test and challenge set, please follow the instructions on how to submit your model via CodaLab worksheet.
https://worksheets.codalab.org/worksheets/0x96f61ee5e9914aee8b54bd11e66ec647/

Open Challenge Leaderboard

Keep track of the latest state-of-the-art systems on CMRC 2018 dataset.
https://hfl-rc.github.io/cmrc2018/open_challenge/

Reference

If you wish to use our data in your research, please cite:

@InProceedings{cui-emnlp2019-cmrc2018,
  author = 	"Cui, Yiming and Liu, Ting and Che, Wanxiang and Xiao, Li and Chen, Zhipeng and Ma, Wentao and Wang, Shijin and Hu, Guoping",
  title = 	"A Span-Extraction Dataset for Chinese Machine Reading Comprehension",
  booktitle = 	"Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing",
  year = 	"2019",
  publisher = 	"Association for Computational Linguistics"
}

International Standard Language Resource Number (ISLRN)

ISLRN: 013-662-947-043-2

http://www.islrn.org/resources/resources_info/7952/

Official HFL WeChat Account

Follow Joint Laboratory of HIT and iFLYTEK Research (HFL) on WeChat.

qrcode.png

Contact us

Please submit an issue.