/py-ownpt

A general Python Library to support the development of OWN-PT and OWN-EN

Primary LanguagePythonMIT LicenseMIT

Py-OpenWordnet (Py-OWN)

This package contains a set of usefull features for manipulation, representation and releasing OpenWordnet-PT. Please, be sure you have the current version of the code installed. Any suggestions are welcome!

Features

Py-OWN contains features for managing OWNs, besides convertion to WN-LMF format. The package pyown.cli contains the folowing commands-line-interfaces: update, statistics, to_lmf and split.

Updating OWNs

For updating OWNs, consider the following example:

$ python3 -m pyown.cli.update openWordnet-PT/data/own-pt-* --wns openWordnet-PT/dump/wn.jsonl --vts openWordnet-PT/dump/votes.jsonl --sgs openWordnet-PT/dump/suggestion-* -l pt -u arademaker vcvpaiva -o own-pt.nt -v

WN-LMF Format

We follow the WN-LMF-1.1.dtd, considering the ili-mapping. For formatting, just follow:

$ python3 -m pyown.cli.to_lmf openWordnet-PT/data/own-pt-* path/to/ili-map.ttl -li own-pt -lb OpenWordnet-PT -vr 1.0 -lg pt -cs 1.0 --status checked -v

For english is similar, just taking care of changing the configurations as needed. Please, check the help message.

Thanks to Global WordNet Association, John McCrae and Francis Bond for the data, under the licence.

Statistics

If needed, one should be able to generate (update) the statistics.org file by following:

$ python3 -m pyown.cli.statistics --ownpt openWordnet-PT/data/own-pt-* --ownen openWordnet-PT/data/own-en-* -v

Logical Splitting

OpenWordnet-PT files are distributed splitted into logical pieces, such as own-**-synsets.ttl or own-**-relations.ttl. For splitting, follow:

$ python3 -m pyown.cli.split openWordnet-PT/data/own-pt-* -l pt -e ttl -o data -v

Development

One may be able to install Py-OWN in developer mode, running

$ pip install -e /path/to/pyown

It's advised to install it using a python virtual environment.