Repository of the Demosthenes corpus, a novel corpus for argument mining in legal documents composed of 40 decisions of the Court of Justice of the European Union on matters of fiscal state aid. The corpus is annotated at three hierarchical levels: the argumentative elements, their types, and their argument schemes.
If you use this repository, dataset, or code, please cite our work as:
Piera Santin, Giulia Grundler, Andrea Galassi, Federico Galli, Francesca Lagioia, Elena Palmieri, Federico Ruggeri, Giovanni Sartor, Paolo Torroni. 2023. Argumentation Structure Prediction in CJEU Decisions on Fiscal State Aid. In Proceedings of 19th International Conference on Artificial Intelligence and Law (ICAIL), to be published, Braga, Portugal.
Giulia Grundler, Piera Santin, Andrea Galassi, Federico Galli, Francesco Godano, Francesca Lagioia, Elena Palmieri, Federico Ruggeri, Giovanni Sartor, and Paolo Torroni. 2022. Detecting Arguments in CJEU Decisions on Fiscal State Aid. In Proceedings of the 9th Workshop on Argument Mining, pages 143–157, Online and in Gyeongju, Republic of Korea. International Conference on Computational Linguistics. URL: https://aclanthology.org/2022.argmining-1.14.
@inproceedings{santin-etal-2023-argumentation,
title = "Argumentation Structure Prediction in CJEU Decisions on Fiscal State Aid",
author = "Santin, Piera and
Grundler, Giulia and
Galassi, Andrea and
Galli, Federico and
Lagioia, Francesca and
Palmieri, Elena and
Ruggeri, Federico and
Sartor, Giovanni and
Torroni, Paolo",
booktitle = "ICAIL '23: 19th International Conference on Artificial Intelligence and Law",
year = "2023",
address = "Braga, Portugal",
publisher = "ACM",
}
@inproceedings{grundler-etal-2022-detecting,
title = "Detecting Arguments in {CJEU} Decisions on Fiscal State Aid",
author = "Grundler, Giulia and
Santin, Piera and
Galassi, Andrea and
Galli, Federico and
Godano, Francesco and
Lagioia, Francesca and
Palmieri, Elena and
Ruggeri, Federico and
Sartor, Giovanni and
Torroni, Paolo",
booktitle = "Proceedings of the 9th Workshop on Argument Mining",
month = oct,
year = "2022",
address = "Online and in Gyeongju, Republic of Korea",
publisher = "International Conference on Computational Linguistics",
url = "https://aclanthology.org/2022.argmining-1.14",
pages = "143--157",
}
- the demosthenes_dataset folder contains the tagged documents in xml format
- xmlToJson.py is a python script that converts the dataset into json format
- create_df.py is a python script that generates two dataframes: the first one contains the annotated sentences with their attributes, while the second contains all the documents' sentences and it is used for the AD task
- argumentmining.py defines the functions that perform the classification tasks
- run_experiments.py calls the argumentmining.py functions with the desired parameters
- run xmlToJson.py to convert the xml dataset into the required json format
- run create_df.py to create the dataframes
- open run_experiments.py to choose the tasks, embeddings and classifiers, or run it as it is to get the complete set of experiments