Use the Dockerfile to build an image & run a container on vs code.
Python version must be 3.7, numpy==1.19.5, tensorflow==2.2.0, and must install spot
library for evaluation.
Also, add the following so that Python can find where the spot library is during the runtime.
echo 'export LD_LIBRARY_PATH=/usr/local/lib' >> ~/.bashrc
This repository is the official implementation of the ICLR'21 paper Teaching Temporal Logics to Neural Networks.
We trained a Transformer to predict solutions to linear-time temporal logical (LTL) formulas. Our trained models evince powerful generalization properties, namely, the generalization to the semantics of the logic, and the generalization to larger formulas than seen during training. This repository includes 1) a data generation framework for teaching reasoning tasks on propositional logic and LTL to a Transformer and 2) a training framework with our experimental results, where our models predict solutions to propositional and LTL formulas.
Contribution. Generally, formulas have many solutions and the training data thus depends on the particularities of the generator. While the Transformer does not perfectly predict the generator’s output, it still produces correct solutions to almost all formulas, even when its prediction deviates from the generator. With this learning framework, it seems to be easier to learn the semantics of the logics than the particularities of the generator.
The Transformer preserves this semantic generalization even when challenged with formulas of a size it has never encountered before (see following picture): Performance of our best model (only trained on LTL formulas up to length 35) on LTL formulas up to length 50 with a tree positional encoding. Exact syntactic matches are displayed in green, the semantic accuracy in light green and the incorrect predictions in orange. The shaded area indicates the formula sizes the model was not trained on. For more details see Teaching Temporal Logics to Neural Networks.
Surprisingly, the Transformer solves almost all LTL formulas in our test set including those for which our generator timed out. For example, very complex formulas such as (LTL formula above, predicted satisfying trace below):
To install the python package, run:
pip install .
In addition, the LTL, automata manipulation and model checking library Spot is required. Please follow the download and installation instructions on their website.
To train a linear-time temporal logic model with dataset ltl-35 for 5 epochs with default parameters, run:
python -m deepltl.train.train_transformer --problem='ltl' --ds-name='ltl-35' --epochs=5
To train a propositional logic model with dataset prop-35 for 5 epochs with default parameters, run:
python -m deepltl.train.train_transformer --problem='prop' --ds-name='prop-35' --epochs=5
Below the available datasets and parameters are described.
To evaluate a linear-time temporal logic model on dataset ltl-50-test with default parameters, run:
python -m deepltl.train.train_transformer --problem='ltl' --ds-name='ltl-50-test' --test
To evaluate a propositional logic model on dataset prop-50-test with default parameters, run:
python -m deepltl.train.train_transformer --problem='prop' --ds-name='prop-50-test' --test
- ltl-35 1 million LTL formulas of maximum size 35 with 5 different propositons split into a training set of 800k formulas, a validation set of 100k formulas and a test set of 100k formulas
- ltl-50-test a test set of 20k LTL formulas of size 35 to 50 with 5 different propositons
- prop-35 1 million propositional logic formulas of maximum size 35 with 5 differnt propositons split into a training set of 800k formulas, a validation set of 100k formulas and a test set of 100k formulas
- prop-50-test a test set of 20k propositional logic formulas of size 35 to 50 with 5 different propositons
- prop-60-no-derived 1 million propositional logic formulas of maximum size 60 with 10 different propositions and without derived operators (exclusive or, equivalence) split into a training set of 800k formulas, a validation set of 100k formulas and a test set of 100k formulas
The following parameter can be specified when training the models:
command line argument | default | parameter |
---|---|---|
problem | ltl | problem (either ltl or prop) |
ds-name | ltl-35 | dataset name |
run-name | default | name of the training / testing run |
d-embed-enc | 128 | embedding dimension encoder |
d-embed-dec | 128 | embedding dimension decoder |
num-layers | 4 | number of encoder/decoder layers |
num-heads | 4 | number of attention heads |
d-ff | 512 | dimension of fully-connected feed-forward networks |
ff-activation | relu | activation function of fully-connected feed-forward networks |
dropout | 0.1 | amount of dropped out units |
warmup-steps | 4000 | number of warmup steps |
tree-pos-enc | False | whether to use the tree positional encoding |
batch-size | 100 | batch size |
epochs | 3 | number of epochs |
alpha | 1.0 | beam search parameter for length normalization |
beam-size | 2 | beam size |
The following table shows our best performing models and the corresponing parameters:
Prop | LTL | |
---|---|---|
ds-name | prop-35 | ltl-35 |
d-embed-enc | 128 | 128 |
d-embed-dec | 64 | 128 |
num-layers | 6 | 8 |
num-heads | 6 | 8 |
d-ff | 512 | 1024 |
dropout | 0.1 | 0.1 |
batch-size | 1024 | 768 |
epochs | 64 | 48 |
alpha | 1.0 | 1.0 |
beam-size | 3 | 3 |
exact match | 58.1% | 83.8% |
correct | 96.5% | 98.5% |
The ltl-35 dataset was created by calling
python -m deepltl.data.generator --num-aps 5 --num-formulas 1000000 --tree-size 35 --timeout 120 --alpha 0.112
where alpha
is used to tune the distribution of formulas, so that it is actually uniform in size.
For more details on available parameters, consider the help (-h
).
To generate a propositional logic dataset, the following additional package is required:
py-aiger-cnf
: simple_aig2cnf branch from https://github.com/MarkusRabe/py-aiger-cnf/tree/simple_aig2cnf
The prop-35 dataset was created by calling
python -m deepltl.data.sat_generator --num-aps 5 --num-examples 1000000 --max-size 35 --alpha 0.095 --target-directory data_prop
Different node distributions and minimal sizes have to be set manually in the code.
@inproceedings{deepltl,
author = {Christopher Hahn and Frederik Schmitt and Jens U. Kreber and Markus Norman Rabe and Bernd Finkbeiner},
title = {Teaching Temporal Logics to Neural Networks},
booktitle = {9th International Conference on Learning Representations, {ICLR} 2021, Virtual Event, Austria, May 3-7, 2021},
publisher = {OpenReview.net},
year = {2021},
url = {https://openreview.net/forum?id=dOcQK-f4byz}
}