This repository contains resources related to COVID-19. This data repository is maintained by DMML lab at Arizona State University.
- covid-19-social-science-research:This resource is designed to help us track new social research about COVID 19, including published findings, pre-prints, projects underway, and projects that are at least at a solid proposal stage.
- CORD-19: CORD-19 is a resource of over 44,000 scholarly articles, including over 29,000 with full text, about COVID-19, SARS-CoV-2, and related coronaviruses.
- LitCovid: a curated literature hub for tracking up-to-date scientific information about the 2019 novel Coronavirus.
- nCovMemory-en: This is a repository dedicated to the English translations of the relevant news reports on the 2019-nCoV outbreak and the resulting epidemic of the Novel Coronavirus Pneumonia (NCP) in China.
- PoliticalFact fact checked: PolitiFact has fact-checked a lot of popular social media posts about the virus, including fake coronavirus cures, false news reports and conspiracy theories about the spread.
- Coronavirus Misinformation Tracking Center: All the news and information sites in the U.S., the U.K., France, Italy, and Germany that published materially false information about the virus found by NewsGuard .
- Malicious URLs: These URLs are malicious urls checked by NewsGuard.
- COVID-19 coronavirus news articles: Database of archived COVID-19 coronavirus news articles. The articles are archived on archive.org and archive.today servers. Be patient!
- COVID-19 Television Coverage Dataset: A New Dataset For Exploring The Coronavirus Narrative On Television News.
- FEMA: Help the public distinguish between rumors and facts regarding the response to coronavirus (COVID-19) pandemic.
- Defense Department: fact and fiction collected by state and local government.
- UALR-Known Misinformation: Known misinformation about coronavirus.
- #CoronaVirusFacts Alliance: the #CoronaVirusFacts/#DatosCoronaVirus Alliance unites more than 100 fact-checkers around the world in publishing, sharing and translating facts surrounding the new coronavirus.
- covid19-misinfo-data: this repository contains Covid19-scientific (CDC, WHO and MedicalNewsToday) and Covid19-politifact (PolitiFact) fact checked claims.
- COVID-19-TweetIDs: The repository contains an ongoing collection of tweets IDs associated with the novel coronavirus COVID-19 (SARS-CoV-2), which commenced on January 28, 2020.
- Coronavirus Tweet Ids: This dataset contains the tweet ids of 51,798,932 tweets related to Coronavirus or COVID-19. They were collected between March 3, 2020 and March 19, 2020 (midnight UTC-0) from the Twitter API using Social Feed Manager.
- Crowdbreaks: Tweets with keywords that related to specific health topics. The system is designed to track trends about health and disease-related issues in real-time across different countries.
- Covid-19: Keywords filtered tweet(coronavirus , 2019nCoV and etc) from February 11th. This dataset contains the tweets and retweets.
- COVID-19 Real World Worry Dataset: Measuring Emotions in the COVID-19 Real World Worry Dataset.
- COVID-19 Infodemic Twitter Dataset: Tweets annotated with fine-grained labels related to disinformation about COVID-19. The labels answer seven different questions that are of interests to journalists, fact-checkers, social media platforms, policymakers, and society as a whole. There are annotations for Arabic and English.
- CoVaxxy: A collection of English-language Twitter posts about COVID-19 vaccines.
- COVID-19 Mobility Monitoring project: Mobility data in Italy. Location is collected anonymously from opted in users through smartphone applications.
- Baidu Mobility Data: The data is scraped from Baidu Migration website.
- Geographic Distribution of COVID-19 cases worldwide: The data file is updated daily and contains the latest available public data on COVID-19.
- Apple Mobility Trends Reports: COVID‑19 mobility trends in countries/regions and cities. Reports are published daily and reflect requests for directions in Apple Maps.
- Covid-19 Community Mobility Reports: This dataset provides insights into what has changed in response to policies aimed at combating COVID-19. The reports chart movement trends over time by geography, across different categories of places such as retail and recreation, groceries and pharmacies, parks, transit stations, workplaces, and residential.
-
Dati COVID-19 Italia: Cases in Italia.
-
CSSE COVID-19 Dataset: Daily case reports
-
Novel Corona Virus 2019 Dataset: This dataset has daily level information on the number of affected cases, deaths and recovery from 2019 novel coronavirus. Please note that this is a time series data and so the number of cases on any given day is the cumulative number.
-
nCoV2019: Individual-level data from national, provincial, and municipal health reports, as well as additional information from online reports. All data are geo-coded and, where available, include symptoms, key dates (date of onset, admission, and confirmation), and travel history.
-
COVID-19_US_County-level_Summaries: We gather a machine readable dataset related to socioeconomic factors that may affect the spread and/or consequences of epidemiological outbreaks, particularly the novel coronavirus (COVID-19).
-
C3.ai COVID-19 Data Lake: A unified, open data image of critical COVID-19 data publicly available at no cost to the global research community beginning on April 13, 2020.
-
[US COVID-19 Daily Cases with Basemap]:(https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/HIDLTK): It contains COVID-19 Daily Cases with US basemap, including state and county-level data. However, for the county-level recovered data, it is available till March 17, 2020.
-
China Health Facilities :Health facilities POI, such as the hospital in China.
-
COVID-19 Metadata: A collection of relevant country/city level metadata about the COVID-19 pandemic, made interoperable for secondary analysis.
-
NY COVID-19: The most recent information collected about people who have tested positive for COVID-19 in NYC.
- California COVID-19 Hospital Data and Case Statistics: California COVID-19 Hospital Data and Case Statistics.
- delphi-epidata: COVID-19 activity level across the U.S. These indicators are derived from a variety of anonymized, aggregated data sources made available by multiple partners.
- data-covid19: The scope of the data set includes the epidemic situation, scientific research, knowledge graph, media information and other aspects.
- α-Satellite: An AI-driven System and Benchmark Datasets for Hierarchical Community-level Risk Assessment to Help Combat COVID-19.
- Coronavirus Knowledge Hub: It provides an up-to-date source of trusted information and analysis on COVID-19 and coronaviruses, including the latest research articles, information, and commentary from our world-class scientific community.
- COVID-19 GIS Hub: Get maps, datasets, applications, and more for coronavirus disease 2019 (COVID-19).
- Google COVID-19: a repository of public datasets like Johns Hopkins Center for Systems Science and Engineering (JHU CSSE), the US Census Bureau's American Community Survey (ACS), and OpenStreetMaps data.
- Amazon COVID-19 Data Lake: It contains COVID-19 case tracking data from Johns Hopkins and The New York Times, hospital bed availability from Definitive Healthcare, and over 45,000 research articles about COVID-19 and related coronaviruses from the Allen Institute for AI.
- COVID-19 Pandemic: COVID-19 Pandemic in Locations with a Humanitarian Response Plan from WHO.
- Coronavirus COVID-19 (2019-nCoV) Epidemic Datasets :Provide the research community with a unified data hub by collecting worldwide fine-grained data merged with demographics, air pollution, and other exogenous variables helpful for a better understanding of COVID-19.
- COVID-19 Public Repository Data: A comprehensive versioned dataset of the repositories and relevant related metadata about public projects hosted on GitHub related to the 2019 Novel Coronavirus and associated COVID-19 disease.
- CoronaQs : FAQs dataset: HTML renderable dataset of FAQs with label collected from various trusted resources like government, UN, WHO etc.
- Policies and Regulations Timeline :Policies and regulations released by the Chinese government, global organizations, western countries, and so on.
- stayinghomeclub: List of companies that are taking steps to address the spread of COVID-19.
- World Bank Indicators of Interest of the COVID-19 Outbreak
- SARS-CoV-2 Sequences: this dataset lists the SARS-CoV-2 sequences curently available in GenBank and the Sequence Read Archive (SRA).
- Genomic epidemiology of novel coronavirus: Genomic epidemiology of novel coronavirus - Global subsampling.
- Postman COVID-19 API Resource Center: Postman provides a list of API for information exchange.
- Definitive Healthcare Public COVID-19 Data Repository: hospital bed availability.
- #COVID19 Government Measures Dataset: The #COVID19 Government Measures Dataset puts together all the measures implemented by governments worldwide in response to the Coronavirus pandemic. Data collection includes secondary data review. The researched information available falls into five categories: Social distancing, Movement restrictions, Public health measures, Social and economic measures, Lockdowns.
Please feel free to send me pull requests or email (yichuan1@asu.edu) / (dmahudes@asu.edu) / (kai.shu@asu.edu) to add resources.