/Cluster-Editing-PACE-2021-instances

Instances of PACE 2021 and scripts to download and convert the source data sets

Primary LanguagePythonCreative Commons Zero v1.0 UniversalCC0-1.0

Software prerequisites:

  • Required: g++, python3

Python packages

  • Required python3 packages: numpy, matplotlib
  • Optional python3 packages (creating some datasets may fail without these): sklearn, pandas, tqdm

python libraries can be installed as root using

`pip3 install <packages>`

or as a user

`pip3 install --user <packages>`

Usage

The datasets can be downloaded and converted using the following command:

`cd data && ./setup.sh`

This creates all weighted and unweighted graphs from various data sources. The datasets are stored in the data/ folder in different subfolders. Weighted instances are similarity matrices stored as csv files. Unweighted instances are stored as .gr files as edge lists in the dimacs format used by PACE. The setup script additionally applies a normalization of the similarity values to use the [0,1] range of values. Lines starting with # in the weighted graphs are comments.

Visualization:

`cd data && ./viz_all.sh`

This creates a pdf rendering of the instances in the same location as the instance files.

Dataset sources

Solvers

See the PACE 2021 report for a list of known solvers with public repository as well as the ranking.