LIAAD/yake

Will Yake support asian languages such Korean or Japanese?

sysmetic opened this issue · 3 comments

I was really inspired by YAKE framework with tremendous usefulness.

Hi @sysmetic
In theory yes, but we can not promise the same performance for logographic languages as in phologic ones.

However the only thing you need to do is provide the stopwords list for that language.

Check the answer for a similar question here #40

Hi @sysmetic
In theory yes, but we can not promise the same performance for logographic langauges as in phologic ones.

However the only thing you need to do is provide the stopwords list for that language.

Check the answer for a similar question here #40

As far as I've known, Korean doesn't have GENERAL stopwords list because it belongs in "agglutinative" languages as well as Japanese. So, to get stop words generally is to use several morph analyzers(https://konlpy.org/en/v0.4.3/morph/) and its results of analyzers are need to filter stop word by the information of pos tagging, which are seems to be stopwords in contexts. If I do available, I can provide you pos-tagger list of Korean aka stopwords

Thank you.

Hi @sysmetic. Let me know if you managed to make it work for Korean.