Kaljurand/owl-verbalizer

Can generate words that are morphologically impossible in ACE

Kaljurand opened this issue · 1 comments

Original issue 1 created by Kaljurand on 2010-06-03T15:35:03.000Z:

OWL verbalizer can generate e.g. "W.H.Bush likes Mary." given that the
ontology contains an individual "W.H.Bush" (or more precisely, if an
individual is annotated to have the surface form "W.H.Bush"). For ACE, this
sentence is actually a sequence of 3 sentences because dot is a sentence
end marker and cannot be contained in an ACE word.

OWL verbalizer should detect and report such cases.

Comment #1 originally posted by Kaljurand on 2011-06-12T10:37:24.000Z:

This is not really a bug. It's the users responsibility to provide ACE-compatible surface forms. In the csv-mode the content words are returned as IRIs, i.e. this issue does not even apply.

It would be nice though if there was a mode in which the OWL verbalizer finds unsuitable surface forms (and escapes them for the output), i.e. the issue is relabeled as "enhancement".