EventKG+Click is a novel cross-lingual dataset that reflects the language-specic relevance of events and their relations. This dataset aims to provide a reference source to train and evaluate novel models for event-centric cross-lingual user interaction, with a particular focus on the models supported by knowledge graphs. EventKG+Click Dataset is based on two data sources:
- the Wikipedia clickstream that reflects real-world user interactions with events and their relations within language-specic Wikipedia editions; and
- the EventKG knowledge graph that contains semantic information regarding events and their relations that partially originates from Wikipedia.
EventKG+Click consists of two subsets:
-
EventKG+Click_event which contains relevance scores, location-closeness, recency and Wikipedia link count factors for more than 4 thousand events; and
-
EventKG+Click_relation with nearly 10 thousand event-centric click-through pairs, and their langugae specific number of clicks, relation relevance and co-mentions of the relation which is the number of sentences in whole Wikipedia language editions that mentions both the source and target.
You can find a complete step by step walkthrough the process of EventKG+Click creation here.
This work is licensed under a Creative Commons Attribution 4.0 International License.
If you find EventKG+Click dataset useful for your research, please consider citing the following paper:
@article{abdollahieventkg+,
title={EventKG+ Click: A Dataset of Language-specific Event-centric User Interaction Traces},
author={Abdollahi, Sara and Gottschalk, Simon and Demidova, Elena}
}