paolobenve/myphotoshare

Problems with non-occidental languages?

Closed this issue · 2 comments

Myphotoshare works quite well with occidental languages, but I gave it a try with some media name in various oriental languages.

What I saw:

  • searching inside words a character which is in a name doesn't produce anything

Perhaps more problems are there.

maybe the solution is to follow https://dmitripavlutin.com/what-every-javascript-developer-should-know-about-unicode/ hints?

Surely a root of the problem is the fact that python generates the search album names converting any unicode character to its ascii equivalent; so, in order to connect the search string with the results we must convert the search string the same way python does: this way we could read the corresponding search albums, and match the media names.

Added an explication in issues page: in languages like chines problably every ideogram is a word (but it's not sure it is every time), but considering an ideogram as a word would probably bloat the code, so the scanner keep treating as words the entities separated by white spaces.