/data-dynalearn

Primary LanguageJupyter NotebookMIT LicenseMIT

Datasets used in the article "Deep learning of contagion dynamics on complex networks"

This repository contains the raw and processed datasets as well as the notebook used to generate the latter.

DOI

Number of cases

  • raw file: cases-timeseries-20210327.csv
  • processed file: cases-timeseries.data
  • details:
    • column provincia_iso corresponds to the province using the ISO 3166-2:ES naming convention;
    • column fecha indicates the date in format YYYY-MM-DD;
    • column num_casos provides the number of daily new cases.
  • official data from the Centro Nacional de Epidemiología.
  • original file downloaded on 2021-03-27.

Province population sizes

  • raw file: population-data.csv.
  • processed file: population.data.
  • details:
    • column Provincias corresponds to the province name;
    • column Total provides the total population.
  • official data from the Instituto Nacional de Estadística.
  • original file for the year 2019 downloaded on 2020-08-11.

Mobility data

Note: The insular provinces are further subdivided into their islands:

  • 07 Illes Balears: 07_1 Menorca, 07_2 Ibiza and 07_3 Mallorca
  • 35 Las Palmas: 35_1 Lanzarote, 35_2 Fuerteventura and 35_3 Gran Canaria
  • 38 Santa Cruz de Tenerife: 38_1 El Hierro, 38_2 Tenerife, 38_3 La Gomera and 38_4 La Palma

Note: more than one row may correspond to the same origin-destination pair since the original data breaks down the origin-destination journeys based on the period of the day as well as the means of transportation:

  • period of the day:
    • P1: 00:00 - 06:00
    • P2: 06:00 - 10:00
    • P3: 10:00 - 17:00
    • P4: 17:00 - 00:00
  • means of transportation:
    • roads: autobús, privado, carretera
    • seaways: barco
    • railways: tren
    • airways: avión

Note: numbers are represented using the Spanish convention where thousands are separated using a comma (ex: 100,000 denotes one hundred thousand)

Reference

Deep learning of contagion dynamics on complex networks
Charles Murphy, Edward Laurence and Antoine Allard,
Nature Communications 12, 4720 (2021)