/katholou

A super-repository of treebanks originally based on a Perseus-like formlism and converted to UD format

Primary LanguagePython

Katholou UD Treebanks of Ancient Greek

Aims

Katholou collects a conversion of a series of treebanks of Ancient Greek into the Universal Dependency format.

The data are organized in subfolders, according to the source project they were converted from (see Treebanks, below).

The conversion was made with using my tb2ud routine. Please, report any issue using the issue tracker. If you spot any conversion problem, you can use the special ud conversion label to help me track them.

Treebanks

At the moment we have conversions from the following projects:

  • Daphne: my revised annotations on Sophocles (Aj., El., OT, Ant., Tr.), Aeschylus (Ag., Eu., PV).

  • Gorman Trees: Vanessa Gorman's extensive annotation on a lot of prose authors and texts [doi].

  • AGDT: the Greek authors and texts, minus the one already included in Daphne or Gorman Trees; this means: Il., Od, Hesiod, Hom. Hymn 2.

Coming soon:

  • Hesiod from the AGDT.

  • Pedalion: annotations in the context of the Pedalion project, by Alek Keersmaekers, Ton van Hal et al.

Name

Katholou (καθόλου), "on the whole, in general", is a word used especially by Aristotle to refer to the technical concept of "the universal", that which can be predicated of several individuals (Peters 1967: 100-1).

It seems like a good name for a project that:

  1. adopts the Universal Dependency formalism;
  2. aims to collect all the available Perseus-based treebanks.

Attribution

The conversion is the work of F. Mambrini, though (at least at the moment) no manual post-processing or editing was performed on the output of the conversion scripts.

For the original annotations of the souce projects, see the README file in each of the subdirectories for more information and for attribution to the orignal authors.

License

The data are distributed under a CC-BY-SA license.

See the linked websites of th different projects for the licenses of the original data.