/Deceiver

Multilingual discourse-annotated dataset for fake news detection

Apache License 2.0Apache-2.0

DOI

RST-Annotated Dataset for Multilingual Fake News Detection


Statements that are intentionally misstated (or manipulated) are of considerable interest to researchers, government, security, and financial systems. Furthermore, there are reliable cues for detecting deception, and the belief that liars give off cues that may indicate their deception is near-universal. Therefore, corroborating that deceiving actions require advanced cognitive development that honesty simply does not require, as well as people’s cognitive mechanisms have promising guidance for deception detection, we propose the first multilingual corpus composed of fake and true news annotated with the Rhetorical Structure Theory (RST) framework. Considering that our work is the first to exploit multilingual discourse-aware strategies for fake news detection, the research community currently lacks multilingual RST-annotated corpora for fake news detection.

FUNDING

SSC-logo-300x171