/sentence2vectors

Segment a Mandarin sentence into words, and embedding these words into vectors.

Primary LanguagePythonMIT LicenseMIT

sentence2vectors

Segment a Mandarin sentence into words, and embedding these words into vectors.

Mandarin word segmentation

Segment Mandarin sentences into words based on jieba library.

Mandarin word embedding

Embedding Mandarin words into vectors by gensim library.

Visualize embedding results

Show embedding results by tensorboard library.

Requirement

  • numpy>=1.14.5
  • jieba>=0.39
  • gensim>=3.5.0
  • tensorflow>=1.8.0 or tensorflow-gpu>=1.8.0

You can use the following command to initialize your python environment:

pip install -r requirements.txt