HGX data

Collection of datasets associated with the library Hypergraphx for higher-order network analysis in Python.

Sources

We collected several hypergraph datasets from different sources. The source of the data is acknowledged on the respective presentation page and we provide the BibTeX for a correct reference in your articles (please write us or provide a correction when needed).

Most of the time, source data is naturally a hypergraph. However, sometimes hypergraphs need to be inferred from data stored as a collection of pairwise interactions. This operation is debatable and not trivial, therefore in this cases we also provide original pairwise data.

Face-to-face interactions data

High school

Contacts and friendship relations between students in a high school in Marseilles, France, in December 2013.

Primary school

Contacts between the children and teachers in a primary school

Hospital

Contacts between patients and healthcare workers in a hospital ward in Lyon, France, from Monday, December 6, 2010 at 1:00 pm to Friday, December 10, 2010 at 2:00 pm.

Workplace

Contacts between individuals measured in an office building in France, from June 24 to July 3, 2013.

Conference (Hypertext)

Face-to-face interactions during ACM Hypertext 2009 conference in Torino, Italy (June 29 - July 1, 2009).

Conference (SFHH)

Face-to-face interactions during SFHH conference in Nice, France (June 4-5, 2009).

E-mails

Enron

Dataset is provided by Chodrow and Mellor (2020).

EU

Dataset is from SNAP (Leskovec and Krevl 2014).

Citations

Dm

Subsets of a DBLP citation dataset (Sinha et al. 2015). The subsets consist of papers published in the venues of data mining.

Software

Subsets of a DBLP citation dataset (Sinha et al. 2015). The subsets consist of papers published in the venues of software engineering.

Bitcoin

2014

The original dataset are provided by Wu et al. (2021), and it contains frst 1,500,000 transactions in 11/2014.

2015

The original dataset are provided by Wu et al. (2021), and it contains frst 1,500,000 transactions in 06/2015.

2016

The original dataset are provided by Wu et al. (2021), and it contains frst 1,500,000 transactions in 01/2016.

Question & Answer

Math

Log data of a question answering site, stack exchange, provided at Archive (2022). We choose math-overfow, which covers mathematical questions.

Server

Log data of a question answering site, stack exchange, provided at Archive (2022). We choose server-fault, which treats server related issues.

Metabolic

iaf1260b

Dataset iAF1260b provided by Yadati et al. (2020).

iJO1366

Dataset iJO1366 provided by Yadati et al. (2020).