/covid_age

COVID-19 cases and deaths by age and sex

Primary LanguageR

COVerAGE-DB

COVID-19 cases, deaths, and tests by age and sex

This project is currently in development, stay tuned for documentation. You can get preliminary output in 5 and 10-year age groups in the /Data folder. This contains columns for Cases, Deaths, and Tests. There may be more populations availale in the unharmonized input database (inputDB.csv), because some are awaiting corrections or additional inputs .

Getting started

A short guide to reading the data into R can be found here. We will add other basic scripts for common data operations, and we will add some Stata examples too.

Data availability

You can get the data by downloading the csv files in the Data/ folder of this repository, or from the OSF site that we mirror to: https://osf.io/mpwjq/.

Here's an overview of global coverage as of now. A country marked as forthcoming means we've identified a source, but that collection is pending for one reason or another. Are you from one of those countries? Please reach out, if so. coverage map

A detailed dashboard of data availability can be found here.

Data processing steps

A (preliminary) description of data processing steps, and dash overview of which steps are applied to each country-region-date subset of data can be found here.

This does not yet give an overview of age-group harmonization performed, but we'll add this soon. Further details will be spelled out in a methods protocol, presently in preparation.

Documentation

Source documentation is presently being gathered and standardized, and we will find a slick way of displaying it when it's further along. All data results are provisional and may change if we decide to change particular sources, or if we change our minds about how to split age groups, etc. So please always use the most up-to-date version.

Notes

Most populations in the database contain multiple time snapshots, and all are continually monitored for new data releases. All statistics reported here are cumulative.

If you know of sources for other populations, please either email, Tweet, or leave an Issue in this repository, and we'll look into it. If you would like to assist this project in gathering data, or other tasks, please let us know and I'm sure we can find a task!

The team (so far), alphabetical by last name

José Manuel Aburto, Enrique Acosta, Diego Alburez-Gutierrez, Anna Altová, Ugofilippo Baselini, Simona Bignami, Didier Breton, Jorge Cimentada, Emanuele del Fava, Viorela Diaconu, Jessica Donzowa, Christian Dudel, Toni Froehlich, Alain Gagnon, Mariana Garcia Cristómo, Armando González, Irwin Hecker, Chia Liu, Andrea Lozer, Mădălina Manea, Victor Manuel Garcia Guerrero, Ryohei Mogi, Saskia Morwinsky, Mikko Myrskylä, Marilia Nepomuceno, Natalie Nitsche, Anna Oksuzyan, Emmanuel Olamijuwon, Marius Pascariu, Filipe Ribeiro, Tim Riffe, Silvia Rizzi, Francisco Rowe, Jiaxin Shi, Rafael Silva, Cosmo Strozza, Catalina Torres, Sergi Trias, Fumiya Uchikoshi, Alyson van Raalte, Paola Vasquez, Estevão Vilela, Iván Williams, Virginia Zarulli

(and a few more people at the moment have made commitments. )

Maybe you too? (we could still use a hand!)

See also

Some other databases relevant to this one:

Age-structured, also related to COVID-19 directly

Total counts, interesting to relate or compare