This github repository corresponds dataset used for our research article titled FakeCovid- A Multilingual Cross-domain Fact Check News Dataset for COVID-19.
FakeCovid is the first multilingual cross-domain dataset of 7623 fact-checked news articles for COVID-19, collected from 04/01/2020 to 01/07/2020. We have collected the fact-checked articles from 92 fact-checking websites after obtaining references from Poynter and Snopes. We have manually annotated the collected articles into 11 categories of the fact-checked news according to their content. The ultimately generated dataset is in 40 languages from 105 countries.
The work has been accepted in the Workshop on Cyber Social Threats (CySoc 2020) at 14th International Conference on Web and Social Media 2020.
For now, cite ICWSM Workshop paper:
@article{shahifakecovid,
title={FakeCovid-A Multilingual Cross-domain Fact Check News Dataset for COVID-19},
author={Shahi, Gautam Kishore and Nandini, Durgesh}
}
For help or issues using data, please submit a GitHub issue.
For personal communication related to our work, please contact Gautam Kishore Shahi(gautamshahi16@gmail.com
) and Durgesh Nandini(durgeshnandini16@yahoo.in
).
For more update on the related publication on the topic of FakeCovid, please visit https://gautamshahi.github.io/FakeCovid/