Problem with gensim

Question

Problem with gensim

felipessalvatore opened this issue 7 years ago · 0 comments

After doing some tests, I got a strange result.

gensim perform better than the tf implementation using a Portuguese corpus ("g" stand for gensim and tf stand for tensorflow, the number on the name is the size of the word embedding):

but changing to an English corpus the gensim model has a score close to 0:

I don't known why the gensim model is performing so bad when we change language. This is probably a bug in how gensim train the corpus, it should be great if someone address this problem.