yao8839836/doc2vec_java

How to use?

Opened this issue · 1 comments

Hi.

Thanks for sharing. i understand you write the trained vectors into a file but to use the vector file?

And for the list of words in the training file (e.g. amazon doc) what should i put if i am generating my own sentences? for instance, your amazon shows "lowest setting level loud", it seem to a tokenize words from a sentence without the stop words.

Can give an example ?

@lchunleo

Hi.

The lines in the vector file are corresponding to your training file (with the same order), the vectors can be used in downstream tasks like text classification and text similarity computing.

Each line in the input training file is a document, It can be raw text with out preprocessing.