kuhumcst/DanNet

word.csv doesn't contain all words

simongray opened this issue · 1 comments

Words that have been synthesized, e.g. https://wordnet.dk/dannet/data/word-s21059466, are not included in the words.csv file. The issue seems to be related to these synthesized words not having any lexinfo:partOfSpeech relations available (they only have wn:partOfSpeech. The query that fetches the data for these words expects the former relation, not the latter.

A temporary fix can be to use the latter relation as all words presumably have this relation. A more long-term fix might be to generate the missing relations for the words that lack them.

This issue seems to be related to adjectives only. Not only is lexinfo:partOfSpeech consistently missing, but the object part is both wn:adjective as well as lexinfo:adjective, rather than solely wn:adjective.

A complete fix should therefore include fixes for this in the full dataset, i.e. corrections for the existing wn:partOfSpeech as well as additions of lexinfo:partOfSpeech for every word missing this.