teesloane/trunk

Text splitting regex and word regex are swapped when adding a new language

Closed this issue · 3 comments

Description

I'm learning Norwegian, so I tried to add it as a new language. To do this, I just copied the regexps for Swedish and everything seems fine on the settings page, a sample text is split correctly. However, when I add the same text through Create Text, the entire text is recognized as one single word

EDIT

I figured out the problem: when adding a new language what is inserted in the "text splitting regex" box is saved as the word regex and vice-versa

To reproduce

  1. Add Norwegian as a new language
    image
  2. Create a new text
    image
  3. Read the text
    image

Expected behavior

Words are split correctly, the same text when practicing Spanish is parsed correctly
image

I figured out the problem: when adding a new language what is inserted in the "text splitting regex" box is saved as the word regex and vice-versa

I'm not sure I understand this, but on looking back at the copy I used, "text splitting regex" is maybe not the most clear!
It sounds like your issue is resolved? If so, can I close this?

I'm sorry, I wasn't very clear

This is when I add a language:
image

And this is how the language is saved:
image

As you see, the entries get swapped

Oh, gotcha. Thanks! I will look into that and fix that for the next release.