property synsetId
fredsonaguiar opened this issue · 1 comments
fredsonaguiar commented
The property synsetId
has being used to describe the synset_offset
from https://wordnet.princeton.edu/documentation/wndb5wn, wich isn't unique among synsets of different types (preciselly, different POS). We should replace this information by wn30:offset
, and instantiate the unique wn30:synsetId
as described.
For instance, the triple
<https://w3id.org/own-pt/wn30-en/instances/synset-02734835-n> wn30:synsetId "02734835" .
should be expanded into
<https://w3id.org/own-pt/wn30-en/instances/synset-02734835-n> wn30:synsetId "02734835-n" .
<https://w3id.org/own-pt/wn30-en/instances/synset-02734835-n> wn30:offset "02734835" .
FredsoNerd commented
In 2d7e15c we apply this script. We got the outputs:
python3 pyownpt/cli/property_synset_id.py openWordnet-PT/own-files/own-en-synsets.ttl -o own-en-synsets.ttl -v
INFO:root:loading data from file 'openWordnet-PT/own-files/own-en-synsets.ttl'
INFO:ownpt:start formatting property synsetId
INFO:ownpt:action applied to 117659 cases
total: 235318 triples added
total: 117659 triples removed
INFO:root:serializing output to 'own-en-synsets.ttl'
python3 pyownpt/cli/property_synset_id.py openWordnet-PT/own-files/own-pt-synsets.ttl-o own-pt-synnsets.ttl -v
INFO:root:loading data from file 'openWordnet-PT/own-files/own-pt-synsets.ttl'
INFO:ownpt:start formatting property synsetId
INFO:ownpt:action applied to 117659 cases
total: 235318 triples added
total: 117659 triples removed
INFO:root:serializing output to 'own-pt-synnsets.ttl'