/underthesea

Underthesea - Vietnamese NLP Toolkit

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

Underthesea - Vietnamese NLP Toolkit

image

image

image

image

Documentation Status

Updates

image

[English] [Tiếng Việt]

image

underthesea is a suite of open source Python modules, data sets and tutorials supporting research and development in Vietnamese Natural Language Processing.

Installation

To install underthesea, simply:

Satisfaction, guaranteed.

Usage

1. Word Segmentation

image

image

image

Usage

2. POS Tagging

image

image

image

Usage

3. Chunking

image

image

image

Usage

4. Named Entity Recognition

image

image

image

Usage

5. Text Classification

image

image

image

Install dependencies and download default model

Usage

6. Sentiment Analysis

image

image

image

Install dependencies

Usage

Up Coming Features

  • Text to Speech
  • Automatic Speech Recognition
  • Machine Translation
  • Dependency Parsing

Contributing

Do you want to contribute with underthesea development? Great! Please read more details at CONTRIBUTING.rst.