This repository holds code and artifacts of the extended abstract titled Leveraging Positional Encoding to Improve Fact Identification in Legal Documents accepted in the First International Workshop on Legal Information Retrieval.
For details about the models, check the paper in the workshop proceedings.
Files with a report
prefix holds the results of a specific model.
There is a running script for each model. For example, to run the BERT+PE(C) model we execute the run_BERT_PE_C.py
file. Running a model yields the respective report file. There is no command line parameters.
We are not sure if we are allowed to provide the dataset since it was derived from three other ones. Although, it is easy to craft the dataset: kept the Facts labels and replace the other ones with the Other label. The original datasets are available here, here and here. Check the application_*.py
files to discover the required path for the dataset.
The hyperparameters of a model can be set in the respective run script. In the following we describe such hyperparameters.
TODO: describe hyperparameters