FewFC

Few-shot Financial Chinese event extraction datase

Important

In order to accelerate the research on few-shot event extraction, incremental event extraction and low resources event extraction, we introduce FewFC, a new Chinese Financial sentence-level event extraction
dataset constructed from news reports on the Internet and announcements issued by listed companies. This data set is also used for CCKS TASK 3 evaluation.

Details

The data set contains 10 financial field event types and 8982 sentences.

Event_Definition.pdf

Explains the definition of event types and event roles

Rearranged Dataset

train_base.json

Training set from source domain with 5 event types.

test_base.json

Test set from source domain.

train_trans.json

Training set from target domain.

test_trans.json

Test set from target domain.

Original Dataset

Data divided according to the CCKS evaluation

train_base.json

Training set from source domain in Phase I Evaluation. Same as train_base.json in rearranged dataset.

train_trans_2_type.json

Training set from target domain with only 2 event types in Phase I Evaluation. Part of the train_trans.json in rearranged dataset.

dev_base.json

Development set from source domain in Phase I Evaluation. Same as test_base.json in rearranged dateset.

dev_trans.json

Developement set from target domain with only 2 event types in Phase I Evaluation. Part of the test_trans.json in rearranged dataset.

train_trans_5_type.json

Training set from target domain with all 5 event types in Phase II Evaluation. Same as train_trans.json in rearranged dataset.

test_trans.json

Test set from target domain with all 5 event types. Part of the test_trans.json in rearranged dataset.

Partial Dataset

Data division used in the paper “What the role is vs. What plays the role:Semi-supervised Event Argument Extraction via Dual Question Answering”

Cite

If you use the dataset or the code, please cite this paper:

@inproceedings{Yang2021,
  author    = {Yang Zhou and
               Yubo Chen and
               Jun Zhao and
               Yin Wu and
               Jiexin Xu and
               Jinlong Li
               },
  title     = {What the role is vs. What plays the role: Semi-supervised Event Argument Extraction via Dual Question Answering},
  booktitle = {Proceedings of AAAI-21},
  publisher = {{AAAI} Press},
  year      = {2021},
}

License

Copyright: This data set is released by the nlp group of the Institute of Automation of the Chinese Academy of Sciences, collected by China Merchants Bank, licensed under CC BY-SA 4.0.

TimeBurningFish/FewFC

FewFC

Important

Details

Rearranged Dataset

Original Dataset

Partial Dataset

Cite

License