/Digital_infrastructures_COVID19_misinformation

Code for the Digital Journalism paper: 'Digital Infrastructures of COVID-19 Misinformation: A New Conceptual and Analytical Perspective on Fact-Checking'. In the paper we compare two fact-checking databases (Poynter institute and Google Fact-check explorer) as overarching infrastructures of fact-checking.

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

Digital Infrastructures of COVID-19 Misinformation: A New Conceptual and Analytical Perspective on Fact-Checking

ABSTRACT: Fact-checking databases, as important results of fact checkers’ epistemic work, are increasingly tied together in new overarching infrastructures, but these are understudied and lack transparency despite being an important societal baseline for whether claims are false. This article conceptualizes fact-checking as infrastructure and constructs a mixed-methods approach to examine overlaps and differences and thereby detect biases to increase transparency in COVID-19 misinformation infrastructure at scale. Analyzing Poynter and Google as such overarching infrastructures, we found only a small overlap. Fewer fact-checkers contribute to Google, with fewer stories than to Poynter. 75% of claims in Google are fact-checked by Asian and North American fact-checkers (44% for Poynter) but none by South Americans (20% for Poynter). More stories in Poynter originate from Facebook than outside social media (43% vs. 17%), while Google shows the opposite (16% vs. 38%). In Google, claims originate to a larger extent from public persons. We find similar large topics on “statistics” and “cures,” but also differences regarding smaller topics (e.g., “vaccines”) and types of misinformation (e.g., “virus characteristics”). Thus, the article shows that the infrastructures have inherent biases and argue that making visible such biases will increase transparency for stakeholders using it.

Ida Anthonj Nissen, Jessica Gabriele Walter, Marina Charquero-Ballester & Anja Bechmann (2022) Digital Infrastructures of COVID-19 Misinformation: A New Conceptual and Analytical Perspective on Fact-Checking, Digital Journalism, 10:5, 738-760, DOI: 10.1080/21670811.2022.2026795

Content

File Description
BERT_poynter_googlefc_claim_overlap_github.py Compares the claim titles from the two databases using sentence embedding (BERT) to find overlapping claims
BERT_poynter_googlefc_TopicModel_github.py Automated topic model using BERT to map the topics occuring in the fact-checked stories for both databases