hipe-eval/HIPE-2022-data

Annotation Issue (NewsEye)

Opened this issue · 0 comments

Encountered issue while NEL data processing in file HIPE-2022-v2.1-newseye-dev-de.tsv:

lines 32928-32931

Haa¬	B-PER	_	O	_	_	B-ORG	Q1405350	_	NoSpaceAfter|EndOfLine
senstein	I-PER	_	O	_	_	I-ORG	Q56322697	_	_
&	O	_	O	_	_	I-ORG	Q56322697	_	_
Vogler	O	_	O	_	_	I-ORG	Q56322697	_	_

The Qid covers the correct entity of type ORG. "Haasenstein & Vogler" is considered to be nested entity, that has a person type included. From the definition of nested entities, the smaller entity should be the one nested, thus:

Haa¬	B-ORG	_	O	_	_	B-PER	Q1405350	_	NoSpaceAfter|EndOfLine
senstein	I-ORG	_	O	_	_	I-PER	Q56322697	_	_
&	I-ORG	_	O	_	_	O	Q56322697	_	_
Vogler	I-ORG	_	O	_	_	O	Q56322697	_	_

Later on, in the same file (lines 33541-33543), the correct annotation is used:

Haasenstein	B-ORG	_	O	_	_	B-PER	Q56322697	_	_
&	I-ORG	_	O	_	_	O	Q56322697	_	_
Vogler	I-ORG	_	O	_	_	O	Q56322697	_	_