mozilla/pontoon

Pontoon does not correctly differentiate between Turkish dotted and dotless "i"

Opened this issue · 1 comments

Currently, Pontoon doesn't differentiate between the Turkish 'ı' and 'i' (capital I and İ respectively), despite these being different characters.

For example, these two queries produce the exact same results (in addition to incorrect highlighting):

https://pontoon.mozilla.org/tr/firefox/browser/browser/browser.ftl/?search=%C4%B1&string=246376
https://pontoon.mozilla.org/tr/firefox/browser/browser/browser.ftl/?search=i&string=246376

This bug has been bought up and addressed before: https://bugzilla.mozilla.org/show_bug.cgi?id=1346180

It seems after some research that database collation is the correct and recommended way to go: http://www.i18nguy.com/unicode/turkish-i18n.html

However, even after reverting to our previous approach, I can confirm that Pontoon still doesn't detect the difference between the 'i' and 'ı' characters.

Collation in Django does seem to be supported, but the way we invoke entities.filter and entities.order_by makes me think we'd need a pretty large refactor to properly use Django's Collate function.