Stress on multi-word expressions
reynoldsnlp opened this issue · 1 comments
Taken from reynoldsnlp/udar#19
The lexical underlying form needs to have a persistent stress mark that survives the two-level rule that reduces stresses to the right-most one. For example,...
красно-жёлтых
так как
так что
то есть
Search through an fst2strings version of a stressed transducer for any words with stresses on both sides of spaces and hyphens. Something like this: egrep ":.*[ё́̀].*(% |-).*[ё́̀]"
See giellatekno subversion revision 181941. Most of these have been fixed by changing preceding stress marks that should persist into secondary stress marks ('\u0300').
Those like красно-жёлтых came from the RNC addendum to Zalizniak, so they don't have stress marked at all. They already had a TODO comment to work with a native to place stress. I added a comment that this should be done using secondary stress marks.