URL persistence vs modularisation

Question

URL persistence vs modularisation

VladimirAlexiev opened this issue 9 years ago · 5 comments

Modularisation of NIF moved a bunch of elements from the nif: namespace to the nif-ann: namespace. See "deprecated" in https://github.com/NLP2RDF/ontologies/blob/nif2.1/nif-core/nif-core.ttl.

Modularization is all nice, but have you considered the tons (megatons!) of NIF data already in existence? Is the word persistence in http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core# merely a charade?

I think this one change will prevent us in Multisensor from using NIF 2.1. I think that an overriding prioriy in NIF 2.1 must be backward compatibility.

Note: backward compatibility only starts with URL permanence. Other issues to consider are the consequences of making sub-classes and sub-properties. Economy of representation (number of triples) is an important consideration to keep NLP as RDF a feasible idea, and NIF 2 thought carefully about that. Injudicious use of sub-classes and sub-properties will force NIF users to abandon RDFS... or NIF itself.

Answer 1 · 2016-03-23T16:37:56.000Z

Hi @VladimirAlexiev,
there was not so much renaming going on actually. Most of them are new properties and classes, so backwardscompatibility is not an issue. However, for the OLIA/pos tag/stem properties you are right. I talked to @neradis today and we came up with the following design:

move most of nif-ann: properties and classes to nif-core again (the extra ontology file unneccessarily complicates things)
create a module file for each NLP use case / layer:
https://github.com/NLP2RDF/ontologies/tree/nif2.1/nif-module
These are just drafts, I hope you get the main message. nif-pos.ttl is an exception in that it uses the nif-core namespace.
All other extensions, e.g. for sentiment should use a new namespace, e.g. nif-sent:opinion rdfs:range marl:Opinion

Do you agree with this?

Answer 2 · 2016-03-23T16:39:52.000Z

Other issues to consider are the consequences of making sub-classes and sub-properties. Economy of representation (number of triples) is an important consideration to keep NLP as RDF a feasible idea, and NIF 2 thought carefully about that. Injudicious use of sub-classes and sub-properties will force NIF users to abandon RDFS... or NIF itself.

This issue is solved as well with the above-mentioned fix right?

Answer 3 · 2016-03-24T05:46:34.000Z

Frankly, I'd prefer if you keep everything in a single namespace nif:

what is the chance of prop/class name clashes across the modules? I think it's small because they're all developed by a compact community (coordinated), and speak about different things
if you have multiple namespaces, users can easily be confused which one to use for a specific prop, unless they read it from a reference
you can't register nif-ann: or nif-sent: on prefix.cc as it only takes lowercase letters

Can we take this as acid test: do you have some examples of prop name clashes across the proposed new namespaces?

As a minimum, keep the old props where they were (i.e. move all old annotation props where they were).

Eg in Multisensor we've already used nif:opinion (though that's a small & unimportant part of our annotations, so we could switch to nif-sent:opinion if need be). (BTW, there's NLP2RDF/specification#1 about nif:opinion)

Sub-props and sub-classes is separate: #17

Answer 4 · 2016-03-24T06:18:54.000Z

ok, valid arguments. see the solution proposed for #17

Do you also agree to merge nifs:Annotation from https://github.com/NLP2RDF/ontologies/blob/master/nif-core/nif-stanbol.ttl#L78 into nif-core and rename it AnnotationUnit ?
We really need the feature to express basic alternative annotations.

Answer 5 · 2016-03-24T07:54:11.000Z

I've split this last question into #19.