CoAID (Covid-19 heAlthcare mIsinformation Dataset) is a diverse COVID-19 healthcare misinformation dataset, including fake news on websites and social platforms, along with users' social engagement about such news. It includes 4,251 news, 296,000 related user engagements, 926 social platform posts about COVID-19, and ground truth labels.
If you use this dataset, we appreciate it if you cite the following paper:
@misc{cui2020coaid,
title={CoAID: COVID-19 Healthcare Misinformation Dataset},
author={Limeng Cui and Dongwon Lee},
year={2020},
eprint={2006.00885},
archivePrefix={arXiv},
primaryClass={cs.SI}
}
Version 0.1 (05/17/2020)
- initial version corresponding to arXiv paper
Version 0.2 (08/03/2020)
- added data from May 1, 2020 through July 1, 2020
Version 0.3 (11/03/2020)
- added data from July 1, 2020 through September 1, 2020