GitHub - nets4geeks/DockerComposeDataset (original) (raw)

A set of semantic dataflow diagrams

It is a set of machine readable data flow diagrams that is based on real multi-component application configurations (Docker compose). The set includes 180 items, and we believe it can be used for various research in the threat modeling field.

The set is based on a domain taxonomy (see services.yml).

Each diagram has three representations (see the 'clear' folder):

The diagrams have been manually classified into five categories related to the web applications and data processing (see 'labels'). This taxonomy (ground truth) allows to define type of a background application, what gives the first view of the security analysis.

Details are here

If you want to refer to the dataset, please, cite this publication:

Brazhuk A. A set of semantic data flow diagrams and its security analysis based on ontologies and knowledge graphs //arXiv preprint arXiv:2303.11198. – 2023.

Also, an enhanced set has been created that includes 200 items (see the clear2, labels2 folder and the services2.yml file).

Моre information about Ontology driven threat modeling is here.