/chana-library

Python 3.x library with an NLP toolkit for Shipibo-Konibo

Primary LanguagePythonMIT LicenseMIT

Chana: An NLP toolkit for the Shipibo-Konibo language of Peru.

chana is a Python library of various NLP tools for the Shipibo-Konibo. Some of these tools can be reused on other peruvian indigenous and/or highly agglutinative languages. It is built on top of scikit-learn, python-crfsuite and distributed under MIT license.

Chana has various NLP tools such as:

  • Lemmatizer.
  • Part-of-Speech tagger.
  • Named Entity annotation.
  • Syllabificator.

Installation

Dependencies

Chana requires:

  • Python (>= 3.4)
  • NumPy (>= 1.13.1)
  • Scikit-learn (>= 0.18.1)
  • Python-crfsuite (>= 0.9.5)

User installation

If you already have a working installation of numpy, scikit-learn and python-crfsuite, the easiest way to install chana is using pip :

pip install chana

Help and Support

Important links

Communication

Contact

For any question and feedback please contact:

Acknowledgments

  • Pontificia Universidad Católica del Perú (PUCP)
  • Consejo Nacional de Ciencia, Tecnología e Innovación Tecnológica (CONCYTEC)
  • NVIDIA
  • Amazon Web Services