/doc2vec_java

A Java implementation of doc2vec in ICML'14

Primary LanguageJava

doc2vec_java

A Java implementation of doc2vec in ICML'14 based on https://github.com/NLPchina/Word2VEC_java

demo

src/test/Doc2VecTest.java

Require

Java 7 or above, I use Java 8 in this project. The input file should be in the form of file/amazon_docs.txt. One document per line.

Reference

  1. Le, Quoc V., and Tomas Mikolov. "Distributed representations of sentences and documents." ICML (2014).
  2. Mikolov, Tomas, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean. "Distributed representations of words and phrases and their compositionality." In Advances in neural information processing systems, pp. 3111-3119. 2013.