This project is currently in development, stay tuned for documentation. You can get preliminary output in 5 and 10-year age groups in the /Data
folder. This contains columns for Cases
, Deaths
, and Tests
. There may be more populations availale in the unharmonized input database (inputDB.csv
), because some are awaiting corrections or additional inputs .
A short guide to reading the data into R
can be found here. We will add other basic scripts for common data operations, and we will add some Stata
examples too.
You can get the data by downloading the csv
files in the Data/
folder of this repository, or from the OSF
site that we mirror to: https://osf.io/mpwjq/.
Here's an overview of global coverage as of now. A country marked as forthcoming means we've identified a source, but that collection is pending for one reason or another. Are you from one of those countries? Please reach out, if so.
A detailed dashboard of data availability can be found here.
A (preliminary) description of data processing steps, and dash overview of which steps are applied to each country-region-date subset of data can be found here.
This does not yet give an overview of age-group harmonization performed, but we'll add this soon. Further details will be spelled out in a methods protocol, presently in preparation.
Source documentation is presently being gathered and standardized, and we will find a slick way of displaying it when it's further along. All data results are provisional and may change if we decide to change particular sources, or if we change our minds about how to split age groups, etc. So please always use the most up-to-date version.
Most populations in the database contain multiple time snapshots, and all are continually monitored for new data releases. All statistics reported here are cumulative.
If you know of sources for other populations, please either email, Tweet, or leave an Issue in this repository, and we'll look into it. If you would like to assist this project in gathering data, or other tasks, please let us know and I'm sure we can find a task!
José Manuel Aburto, Enrique Acosta, Diego Alburez-Gutierrez, Anna Altová, Ugofilippo Baselini, Simona Bignami, Didier Breton, Jorge Cimentada, Emanuele del Fava, Viorela Diaconu, Jessica Donzowa, Christian Dudel, Toni Froehlich, Alain Gagnon, Mariana Garcia Cristómo, Armando González, Irwin Hecker, Chia Liu, Andrea Lozer, Mădălina Manea, Victor Manuel Garcia Guerrero, Ryohei Mogi, Saskia Morwinsky, Mikko Myrskylä, Marilia Nepomuceno, Natalie Nitsche, Anna Oksuzyan, Emmanuel Olamijuwon, Marius Pascariu, Filipe Ribeiro, Tim Riffe, Silvia Rizzi, Francisco Rowe, Jiaxin Shi, Rafael Silva, Cosmo Strozza, Catalina Torres, Sergi Trias, Fumiya Uchikoshi, Alyson van Raalte, Paola Vasquez, Estevão Vilela, Iván Williams, Virginia Zarulli
(and a few more people at the moment have made commitments. )
Maybe you too? (we could still use a hand!)
Some other databases relevant to this one:
- INED database on COVID-19 deaths by age and sex: https://dc-covid.site.ined.fr/en/
- Global Health 5050 also provides data by age and sex https://globalhealth5050.org/covid19/sex-disaggregated-data-tracker/
- The Short Term Mortality Fluctutions (STMF) database (all cause mortality) www.mortality.org
- Eurostat compiled data on all-cause mortality by weeks demo_r_mweek3
- Our World in Data testing data https://ourworldindata.org/coronavirus-testing
- JHU total cases and deaths https://github.com/CSSEGISandData/COVID-19