A corpus of prototypical and spontaneous fallacies in Spanish.
Please, refer to this paper for more details:
Cruz, F. L., Troyano, J. A., Enriquez, F., & Ortega, J. (2023). Detección y clasificación de falacias prototípicas y espontáneas en español. Procesamiento del Lenguaje Natural, 71.
This repository contains both the FallacyES corpus, and a Jupyter notebook with the experiments shown in the paper.
The corpus is composed of two sections:
-
Prototypical fallacies: these are examples of fallacies obtained from educational materials. The examples have been translated and corrected from the "Logical Fallacy Dataset" (https://github.com/tmakesense/logical-fallacy/tree/main/dataset-fixed), which in turn is a corrected version of "LOGIC" (https://github.com/causalNLP/logical-fallacy). In addition, examples of non-fallacies have been added.
-
Spontaneous fallacies: these are examples of fallacies obtained from real user comments on a news aggregator website (reference). Examples of non-fallacies have also been located from the same source.
Both sections are available in dataset
folder, including a description of each column in dataset/FallacyES.md
file
Please cite this paper if you use this corpus:
Cruz, F. L., Troyano, J. A., Enriquez, F., & Ortega, J. (2023). Detección y clasificación de falacias prototípicas y espontáneas en español. Procesamiento del Lenguaje Natural, 71.
-
Logical Fallacy Dataset (https://github.com/tmakesense/logical-fallacy/tree/main/dataset-fixed) and LOGIC (https://github.com/causalNLP/logical-fallacy) used as described in its licenses (open-source MIT license for non-commercial use)
-
old.meneame.net contents used as described in its license (https://creativecommons.org/licenses/by/3.0/)