CoAID

Introduction

CoAID (Covid-19 heAlthcare mIsinformation Dataset) is a diverse COVID-19 healthcare misinformation dataset, including fake news on websites and social platforms, along with users' social engagement about such news. It includes 5,216 news, 296,752 related user engagements, 958 social platform posts about COVID-19, and ground truth labels.

If you use this dataset, we appreciate it if you cite the following paper:

@misc{cui2020coaid,
    title={CoAID: COVID-19 Healthcare Misinformation Dataset},
    author={Limeng Cui and Dongwon Lee},
    year={2020},
    eprint={2006.00885},
    archivePrefix={arXiv},
    primaryClass={cs.SI}
}

History

Version 0.1 (05/17/2020)

  • initial version corresponding to arXiv paper

Version 0.2 (08/03/2020)

  • added data from May 1, 2020 through July 1, 2020

Version 0.3 (11/03/2020)

  • added data from July 1, 2020 through September 1, 2020

Version 0.4 (01/08/2021)

  • added data from September 1, 2020 through November 1, 2020