Language-specific validation for Ancient Greek in UD
$ cat genesis.conllu | PYTHONPATH="/home/daniel/UD-GreekCheck:$PYTHONPATH" udapy -s .GreekCheck >/dev/null
[...]
2023-07-19 15:59:17,494 [WARNING] after_process_document - ud.MarkBugs Error Overview:
unsplit-crasis 2
finverb-mood 17
no-PronType 163
finverb-tense 167
no-NumType 277
no-Aspect 770
TOTAL 1396
This has the same interface as the udapi MarkBugs module, and the structure is largely copied from there.
This mostly skips checks performed by MarkBugs or the standard validation process in favor of things not checked elsewhere.
Check | Description |
---|---|
unsplit-crasis |
Crasis (identified by breathing mark after a consonant) should be split into MWTs (issue) |
non-crasis-mwt |
Non-crasis compounds, such as negative conjunctions, should not be split into MWTs (issue) |
no-[feat] |
Requires PronType for PRON|DET , NumType for NUM , and VerbForm and Aspect for VERB |
finverb-[feat] |
Requires VerbForm=Fin to be accompanied by VERB|AUX , Mood , and Tense |