/FewFC

Creative Commons Attribution Share Alike 4.0 InternationalCC-BY-SA-4.0

FewFC

  • Few-shot Financial Chinese event extraction datase

Important

  • In order to accelerate the research on few-shot event extraction, incremental event extraction and low resources event extraction, we introduce FewFC, a new Chinese Financial sentence-level event extraction
  • dataset constructed from news reports on the Internet and announcements issued by listed companies. This data set is also used for CCKS TASK 3 evaluation.

Details

The data set contains 10 financial field event types and 8982 sentences.

Event_Definition.pdf

  • Explains the definition of event types and event roles

Rearranged Dataset

train_base.json

  • Training set from source domain with 5 event types.

test_base.json

  • Test set from source domain.

train_trans.json

  • Training set from target domain.

test_trans.json

  • Test set from target domain.

Original Dataset

  • Data divided according to the CCKS evaluation

train_base.json

  • Training set from source domain in Phase I Evaluation. Same as train_base.json in rearranged dataset.

train_trans_2_type.json

  • Training set from target domain with only 2 event types in Phase I Evaluation. Part of the train_trans.json in rearranged dataset.

dev_base.json

  • Development set from source domain in Phase I Evaluation. Same as test_base.json in rearranged dateset.

dev_trans.json

  • Developement set from target domain with only 2 event types in Phase I Evaluation. Part of the test_trans.json in rearranged dataset.

train_trans_5_type.json

  • Training set from target domain with all 5 event types in Phase II Evaluation. Same as train_trans.json in rearranged dataset.

test_trans.json

  • Test set from target domain with all 5 event types. Part of the test_trans.json in rearranged dataset.

Partial Dataset

  • Data division used in the paper “What the role is vs. What plays the role:Semi-supervised Event Argument Extraction via Dual Question Answering

Cite

If you use the dataset or the code, please cite this paper:

@inproceedings{Yang2021,
  author    = {Yang Zhou and
               Yubo Chen and
               Jun Zhao and
               Yin Wu and
               Jiexin Xu and
               Jinlong Li
               },
  title     = {What the role is vs. What plays the role: Semi-supervised Event Argument Extraction via Dual Question Answering},
  booktitle = {Proceedings of AAAI-21},
  publisher = {{AAAI} Press},
  year      = {2021},
}

License

Copyright: This data set is released by the nlp group of the Institute of Automation of the Chinese Academy of Sciences, collected by China Merchants Bank, licensed under CC BY-SA 4.0.