Tony-Wang/word2vec

Cluster output malformed

GoogleCodeExporter opened this issue · 0 comments

What steps will reproduce the problem?
1. Train on data with tokens that are numbers "12345" , "321" ...
2.  Select cluster output ( -classes option)

What is the expected output?

The expected cluster output is:

<token>, <cluster number>
ex:

quick, 44
357, 45

Instead, what you see is:

quick, 44
357 45,

What version of the product are you using? On what operating system?

0.1b on OSX 10.10.2

Please provide any additional information below.

it is currently 47oF and overcast outside


Original issue reported on code.google.com by craig.pf...@gmail.com on 8 Feb 2015 at 7:25