ryancotterell/sigmorphon2016

Issues in Turkish data

ftyers opened this issue · 0 comments

The following examples in the Turkish data look like errors:

kesermızda|||keser|||pos=N|poss=PSS1P|case=ESS|num=SG
keserlarını|||keser|||pos=N|poss=PSS3P|case=ACC|num=PL
keserlarımızı|||keser|||pos=N|poss=PSS1P|case=ACC|num=PL
keser|||keser|||pos=N|case=NOM|num=SG
keserler|||keser|||pos=N|num=PL
kesermdan|||keser|||pos=N|poss=PSS1S|case=ABL|num=SG
keserlarıma|||keser|||pos=N|poss=PSS1S|case=DAT|num=PL
kesermın|||keser|||pos=N|poss=PSS1S|case=GEN|num=SG
kesernızı|||keser|||pos=N|poss=PSS2P|case=ACC|num=SG
keserlarının|||keser|||pos=N|poss=PSS3P|case=GEN|num=PL
keserlarının|||keser|||pos=N|poss=PSS3S|case=GEN|num=PL

They should probably be:

keserimizde|||keser|||pos=N|poss=PSS1P|case=ESS|num=SG
keserlerini|||keser|||pos=N|poss=PSS3P|case=ACC|num=PL
keserlerimizi|||keser|||pos=N|poss=PSS1P|case=ACC|num=PL
keser|||keser|||pos=N|case=NOM|num=SG
keserler|||keser|||pos=N|num=PL
keserimden|||keser|||pos=N|poss=PSS1S|case=ABL|num=SG
keserlerime|||keser|||pos=N|poss=PSS1S|case=DAT|num=PL
keserimin|||keser|||pos=N|poss=PSS1S|case=GEN|num=SG
keserinizi|||keser|||pos=N|poss=PSS2P|case=ACC|num=SG
keserlerinin|||keser|||pos=N|poss=PSS3P|case=GEN|num=PL
keserlerinin|||keser|||pos=N|poss=PSS3S|case=GEN|num=PL

There is also:

keserymışız|||keser|||pos=N|tense=PST|per=1|num=SG|evid=NFH

I'm not sure what to do with that one... the vowel harmony is off, and it looks like: keser>(y)mIş>Iz "presumably we were a hammer", but I'm not sure if you need the -y- or not.

Also, case=ESS should probably be case=LOC, and is there any reason for choosing the order of features ? It doesn't follow the morpheme order (pos, num, poss, case) or alphabetical order... (pos, case, num, poss).