GrammaticalFramework/gf-wordnet

Somali orthography

inariksit opened this issue · 3 comments

Great to have Somali coverage in GF!

I'm curious to know where the data is from. It looks rather linguistic, with lots of IPA characters.

lin question_1_V2 = mkV2 (mkV "suʔal") noPrep ; --guessed
lin put_6_V2 = mkV2 (mkV "ɖɪg") noPrep ; --guessed
lin squeeze_1_V2 = mkV2 (mkV "tuːǯi") noPrep ; --guessed

The sources I've read that try to adhere to Somali conventions (like this one), would use ' for the glottal stop, and dh for the retroflex. The word squeeze would be written tuuji. There are many other examples as well, and I'm not quite sure about all of them. So I'd like to know the source, so I can make a more educated guess.

I would like the GF resource to reflect more the prevalent usage than grammar books. If you agree, I can take care of the transliteration.

Smart paradigms for verbs

This is a TODO item for myself.

When I wrote the fragments of the Somali RG 2 years ago, I left my verb morphology quite unfinished. I was trying to decide which form would be the dictionary form, and found different conventions in different lexica and grammars. I didn't come to a conclusion, so my current implementation takes an artificial stem, which is not any actual wordform. I can adjust my smart paradigms to work with different forms, but that's good to keep in mind if we get words from different sources: their dictionary form may not be the same.

Okay, I understand! So it's likely that already the words in there currently have different base forms. Unfortunately I don't know Somali in any meaningful way, just random anecdotes about morphosyntax. :-P But I noticed that the word "ulayn" appears as a translation for 890 different words, which seem to be pretty arbitrary (here's a party game: what's in common with accept, chop, paste, seduce, trash… and 800+ more words?)

I think I'll contact someone who actually knows Somali, and try to organise a meeting to go through the lexicon. I'll get back to you after I have some more information.