A set of semantic dataflow diagrams

It is a set of machine readable data flow diagrams that is based on real multi-component application configurations (Docker compose). The set includes 180 items, and we believe it can be used for various research in the threat modeling field.

The set is based on a domain taxonomy (see services.yml).

Each diagram has three representations (see the 'clear' folder):

YAML file that can be used to draw a picture (a visual view of a diagram) or might be converted to another format,
OWL (Web Ontology Language) ontology that can be used to reason extra facts about the diagram,
RDF (Resource Description Framework) graph that contains ‘full’ knowledge about the diagram and can be questioned about various aspects of design and security by SPARQL requests (see 'sparql/requests').

The diagrams have been manually classified into five categories related to the web applications and data processing (see 'labels'). This taxonomy (ground truth) allows to define type of a background application, what gives the first view of the security analysis.

Details are here

If you want to refer to the dataset, please, cite this publication:

Brazhuk A. A set of semantic data flow diagrams and its security analysis based on ontologies and knowledge graphs //arXiv preprint arXiv:2303.11198. – 2023.

Also, an enhanced set has been created that includes 200 items (see the clear2, labels2 folder and the services2.yml file).

Моre information about Ontology driven threat modeling is here.

nets4geeks/DockerComposeDataset

A set of semantic dataflow diagrams