HyperGraphs.jl

The main aim of this package is to implement concepts of graph theory on hypergraphs. At the most basic level, it allows to represent high-order relationships between objects, with complete freedom to choose the type of these objects. A secondary aim follows from the realisation that most flavours of (hyper)graphs are a specific case of oriented, weighted hypergraphs; from this, it should be possible to define all methods at the hypergraph level and to programmatically specialise them for other (hyper)graph types.

Currently implemented are fundamental constructors for unoriented, unweighted hypergraphs (via the HyperGraph and HyperEdge types) and for a specific case of oriented, weighted hypergraphs: chemical hypergraphs (via the ChemicalHyperGraph and ChemicalHyperEdge types), as well as a set of functions that allow to modify hypergraphs (adding and deleting hyperedges and vertices). Flavours of hypergraphs and hyperedges are implemented with traits via SimpleTraits.jl; traits implemented so far are IsOriented and IsWeighted.

For some example applications of HyperGraphs.jl, check out its sibling package Simulacrum.jl.

Notation and naming rules

In the code, x and xs refer to one and several hypergraphs, respectively; the same applies for e and es with hyperedges, and for v and vs with vertices.

The current idea to allow for natural extension of the core functions is to respect a set of standard field names when defining a new custom concrete type. If custom field names are needed, these should be explicitly connected to core methods (vertices and hyperedges mainly). Currently, field names should be:

V, the set of vertices in a hypergraph and a hyperedge
E, the set of hyperedges in a hypergraph
src, the set of source vertices in an oriented hyperedge
tgt, the set of target vertices in an oriented hyperedge
objs, the objects of an incidence
mults, the multiplicities of the objects of an incidence
w, the weight of a weighted hyperedge

Then, custom names are built on top e.g. rate(e::ChemicalHyperEdge) = weight(e) (which is already implemented).

These symbols were chosen to differentiate hypergraphs from graphs (all graphs are hypergraphs but not all hypergraphs are graphs), and so a hypergraph is defined as X = (V, E). An alternative option was to name hypergraphs with the letter H; however, following that logic would have meant changing the symbol for hyperedges too, which would have been impractical (also H). This way, using X emphasises the difference between hypergraphs and graphs and using E for edges retains some continuity with ordinary graphs notation G = (V, E).

Functions are defined on hypergraphs x and edges e for consistency. It may be useful to add some clarity and differentiate between variables though, and so a chemical hypergraph may be defined as chx and a chemical hyperedge as che.

Some notes on the code

A word of caution: this is a work in progress, and so functions have not been perfectly proofread for correctness (especially those in operations, neighbor functions, parallel and multi-hyperedges, loops...). E.g. currently, it is possible to add hyperedges to a hypergraph that does not have all the vertices in said hyperedges in its vertex set; this will be prevented from happening eventually.

Each graph flavour should be implemented with performance and ease of interfacing (i.e. of accessing information) in mind: there is no need to carry redundant information just because the mathematical syntax does. Hopefully this is true in the current code, but some improvements are definitely possible.

Implementation notes

Most of these will hopefully end up in the documentation.

Core

The source and the target of an oriented hyperedge are also referred to as head and tail, but the former notation is more explicit.

Default values

Default field values are implemented by overloading Base.getproperty. This means object.some_field can return a value despite object having no corresponding :some_field field.

So far, objects of type AbstractHyperEdge have a default :weight value of 1.

Chemical hypergraphs and chemical hyperedges

Chemical hypergraphs represent reaction networks but are rooted in graph theory. The main reference is probably [Jost2019].

In this implementation, I took the liberty to name chemical hyperedges the reactions, following the same logic as naming chemical hypergraph the hypergraph that represents a system of reactions (this is because in each case it is a specific case of hypergraph and hyperedge, namely one where the hyperedge incidence multiplicity is restricted to the positive integers, thus representing stoichiometries).

Graph theoretical concepts have natural interpretations in the context of chemical reactions represented on chemical hypergraphs. For instance, a vertex-hyperedge incidence multiplicity represents how many times a vertex occurs in a hyperedge and as such encodes stoichiometry information, while a hyperedge-hypergraph incidence multiplicity encodes for parallel hyperedges (this is not currently implemented).

Constructors set 1 as the default value for reactions rate and for stoichiometries; this is to simplify the syntax. For instance, all these calls are equivalent: SpeciesSet(["X"]), SpeciesSet("X"), SpeciesSet("X", 1), SpeciesSet(["X"], [1]). Compared to other implementations of reaction networks, it is also easier to specify a different stoichiometry for only part of the reaction (e.g. if one needs only some of the reactants to have stoichiometries different from 1, only the reactant stoichiometries have to be specified and not those of all the species involved).

Properties

Vertex degree

The usual definition of the degree of a vertex v is the number of edges incident on v; this definition however breaks down in the case of a loop: following it strictly implies a loop has degree 1 (a loop is one edge incident on one vertex) when it is generally agreed it has degree 2 [Bretto2013], [Bollobas1998], [Kaminski2019], [Zaslavsky1982].

A more general definition is then probably the number of incidences at v, as given by Zaslavsky [Zaslavsky1982]. Following this more general definition gives us degree 2 for a loop (since a loop on v is twice incident on v [Zaslavsky1982]) while working in the same way as the former definition in other cases.

Here we implement an even more general definition of degree: the degree of vertex v is defined as the sum of the weights of edges incident on v, where the weight of edge e appears according to the multiplicity of v in e. In other words, the degree of v is given by the sum over edges of edge weight times the number of incidences of v in each edge. Other definitions discussed above are special cases of this more general definition.

This however means that degree may now return values that are not of type Number, depending on what type edge weights are. This may cause unexpected behaviours since some concepts rely on e.g. maximum or sum being defined on degree values. Those effectively assume that degrees are defined on the positive integers including zero.

The vertex degree is also referred to as valency. Additionally, the degree of a graph is its maximum vertex degree [Zhu2019], which is not implemented here to avoid confusion.

Edge cardinality

The cardinality of an edge is its number of endpoints; this can be interpreted in different ways, which becomes obvious when working with loops. If the vertices of an edge are treated as a multiset or as an n-tuple (e.g. when working with oriented edges), a loop should have cardinality 2. This comes from the fact that a loop has two (coinciding) endpoints ([Zaslavsky1982], [Zaslavsky1991]), with the cardinality of a multiset being the sum of the multiplicities of its elements. If it is instead treated as a set, a loop has cardinality 1, as is often defined (e.g. in [Dorfler1980], [Berge1989], [Bretto2013]). Concisely, a loop has cardinality 2 when counting its number of endpoints; 2 when counting the number of elements of a multiset / n-tuple; 1 when counting the number of elements of a set. This seems to be what is suggested on page 4 of [Spivak2009]: the map that sends the set of non-empty tuples on a set S (i.e. edges) to the set non-empty subsets of S (i.e. vertices) "may decrease cardinality." The approach taken here is to adopt the most general definition (i.e. considering the vertices of edges as a multiset / an n-tuple) which then leaves freedom to build more specialised functions on top.

Hyperedge cardinality is also referred to as order [Zhu2019], and as size. Neither of these are implemented; the former may be confused with the order of a hypergraph, and the latter may conflict with Base.size.

Other properties

The size of a hypergraph is defined in [Gallo1993] and in [Cambini1997]; this is implemented as hypergraph_size to avoid confusion with Base.size again.

The order of a hypergraph is its number of vertices [Wang2018], [Zaslavsky2010]. (Also note that order is used to refer to the maximum cardinality of a hypergraph in [Zhu2019], but this use seems unusual.)

Note that the rank of a reaction network is the maximum number of linearly independent reactions [Shinar2010], which may be confused with the rank of a hypergraph.

A reference for the volume of a hypergraph is [Kaminski2019].

Operations

Functions only do what their name implies, and so if a loop is undesirable in a specific application, one must check that e.g. the hyperedge that is about to be added to a hypergraph is not a loop.

Weak deletion only deletes incidences, whereas strong deletion deletes any object incident on the object(s) being deleted; this means that e.g. weak vertex deletion removes any occurrence of the given vertex in the hyperedges incident on that vertex but does not delete those hyperedges [Chen2018], [Rusnak2013]. Note: vertex deletion is (incidence) dual to edge deletion [Rusnak2013].

Note: currently, the internal function _del_vertex!(e) must have two methods: one for unoriented and one for oriented hyperedges; this is because the former method only needs to remove a vertex from vertices(e) when the latter needs to remove it from both the source and target sets (and both objects and multiplicities), which does not naturally work with vertices of an oriented hyperedge. This is somewhat messy; ideally, as mentioned above, methods would be built automatically from the hyperedge flavour (here oriented vs. unoriented), via traits.

Edge switching is implemented at a high level by swapping the source and target sets of an oriented hyperedge. Switching is more fundamentally defined as the negation of incidences, e.g. in [Reff2012], [Rusnak2013].

Future developments

This is mainly a personal repository to play around with hypergraphs, but I do plan to add more functionalities over time.

Note a potential future breaking change about the behaviour of src and tgt: these currently return the objects of the set of incidences but may return the set of incidences themselves in the future, depending on what makes sense. This also means that currently, the extension of Base.== does not check for equal multiplicity (which may be slightly incorrect).

References

[Berge1989] -- Berge, C. (1989). Hypergraphs: combinatorics of finite sets. Amsterdam, New York, Oxford, Tokyo: Elsevier B.V.

[Bollobas1998] -- Bollobás, B. (1998). Modern Graph Theory. Graduate Texts in Mathematics (Vol. 184). New York, NY: Springer. [doi]

[Bretto2013] -- Bretto, A. (2013). Hypergraph Theory: An Introduction. Mathematical Engineering (Vol. 11). Heidelberg: Springer. [doi]

[Burgio2020] -- Burgio, G., Matamalas, J. T., Gómez, S., & Arenas, A. (2020). Evolution of Cooperation in the Presence of Higher-Order Interactions: from Networks to Hypergraphs. Entropy. [doi]

[Cambini1997] -- Cambini, R., Gallo, G., & Scutellà, M. G. (1997). Flows on hypergraphs. Mathematical Programming, Series B, 78(2), 195–217. [doi]

[Chen2018] -- Chen, G., Liu, V., Robinson, E., Rusnak, L. J., & Wang, K. (2018). A characterization of oriented hypergraphic Laplacian and adjacency matrix coefficients. Linear Algebra and Its Applications, 556, 323–341. [doi]

[Dorfler1980] -- Dörfler, W., & Waller, D. A. (1980). A category-theoretical approach to hypergraphs. Archiv Der Mathematik, 34(1), 185–192. [doi]

[Gallo1993] -- Gallo, G., Longo, G., Pallottino, S., & Nguyen, S. (1993). Directed hypergraphs and applications. Discrete Applied Mathematics, 42(2–3), 177–201. [doi]

[Hellmuth2012] -- Hellmuth, M., Ostermeier, L., & Stadler, P. F. (2012). A Survey on Hypergraph Products. Mathematics in Computer Science, 6(1), 1–32. [doi]

[Jost2019] -- Jost, J., & Mulas, R. (2019). Hypergraph Laplace operators for chemical reaction networks. Advances in Mathematics, 351, 870–896. [doi]

[Kaminski2019] -- Kamiński, B., Poulin, V., Prałat, P., Szufel, P., & Théberge, F. (2019). Clustering via hypergraph modularity. PLoS ONE, 14(11), 1–15. [doi]

[Reff2012] -- Reff, N., & Rusnak, L. J. (2012). An oriented hypergraphic approach to algebraic graph theory. Linear Algebra and Its Applications, 437(9), 2262–2270. [doi]

[Rusnak2013] -- Rusnak, L. J. (2013). Oriented hypergraphs: Introduction and balance. Electronic Journal of Combinatorics, 20(3), 1–29. [doi]

[Shinar2010] -- Shinar, G., & Feinberg, M. (2010). Structural sources of robustness in biochemical reaction networks. Science, 327(5971), 1389–1391. [doi]

[Spivak2009] -- Spivak, D. I. (2009). Higher-dimensional models of networks. arXiv, 1–18. Retrieved from http://arxiv.org/abs/0909.4314

[Wang2018] -- Wang, L., Egorova, E. K., & Mokryakov, A. V. (2018). Development of Hypergraph Theory. Journal of Computer and Systems Sciences International, 57(1), 109–114. [doi]

[Zaslavsky1982] -- Zaslavsky, T. (1982). Signed graphs. Discrete Applied Mathematics, 4(1), 47–74. [doi]

[Zaslavsky1991] -- Zaslavsky, T. (1991). Orientation of Signed Graphs. European Journal of Combinatorics, 12(4), 361–375. [doi]

[Zaslavsky2010] -- Zaslavsky, T. (2010). Matrices in the Theory of Signed Simple Graphs. arXiv, 1–20. Retrieved from http://arxiv.org/abs/1303.3083

[Zhu2019] -- Zhu, H., & Masahito H. (2019). Efficient verification of hypergraph states. Physical Review Applied, 12(5), 054047. [doi]

zsteve/HyperGraphs.jl