/RAGcorpus

The Ancient Greek corpus RAG (Reports in Ancient Greek, a sample from Thucydides’ History of the Peloponnesian War) is manually annotated with speech, attitude and perception reports.

RAG corpus

The Ancient Greek corpus RAG (a sample from Thucydides’ History of the Peloponnesian War) is manually annotated with speech, attitude and perception reports. Lemmas and Part-of-Speech tags in the corpus are not manually added but the output of the Ancient Greek lemmatizer and POS-tagger that we developed (see https://github.com/GreekPerspective/glem, also for the accuracy). The research is supported by the EU under FP7, ERC Starting Grant 338421-Perspective (see http://ncs.ruhosting.nl/perspective/).

We present the REPORTS annotation scheme in the paper at https://www.aclweb.org/anthology/W17-0806.pdf . A more detailed version including certain decisions that we made in the manual can be found below. We also provide an accompanying list of embedding entities and their (default) classification rules we used, as well as an explanation of the POS strings.

After the annotation of the corpus in BRAT we have ported the annotated corpus to ANNIS to carry out more complex search queries. The corpus can be found at https://applejack.science.ru.nl/annis-gui-3.4.4/ (select Thucydides from the corpus list). Here you can also find some example queries.

License

These materials are made available under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License.

Please refer to this publication when using this material:

Corien Bary, Leopold Hess, Kees Thijs, Peter Berck, and Iris Hendrickx. Annotating speech, attitude and perception reports. In Proceedings of the 11th Linguistic Annotation Workshop (LAW) at the EACL, pages 46–56, Valencia, Spain, 2017.