lemmatizer

There are 97 repositories under lemmatizer topic.

  • spark-nlp

    JohnSnowLabs/spark-nlp

    State of the Art Natural Language Processing

    Language:Scala3.7k100875704
  • JohnSnowLabs/nlu

    1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.

    Language:Python8242343128
  • BLKSerene/Wordless

    An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation

    Language:Python673292387
  • gutfeeling/word_forms

    Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.

    Language:Python608161470
  • CogComp/cogcomp-nlp

    CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, transliteration, verb-sense, and more.

    Language:Java46963385144
  • nlpub/pymystem3

    A Python wrapper of the Yandex Mystem 3.1 morphological analyzer (http://api.yandex.ru/mystem). The original tool is shipped as a binary and this library makes it easy to integrate it in Python projects. Let us know in the issues if you would like to be involved into the developments or maintenance of this project. If you have any fix or suggestion, please make a pull request. We are very open to accepting any contributions.

    Language:Python288192444
  • Dadmatech/DadmaTools

    DadmaTools is a Persian NLP tools developed by Dadmatech Co.

    Language:Python16472635
  • adbar/simplemma

    Simple multilingual lemmatizer for Python, especially useful for speed and efficiency

    Language:Python13075810
  • yohasebe/lemmatizer

    Lemmatizer for text in English. Inspired by Python's nltk.corpus.reader.wordnet.morphy

    Language:Ruby1088815
  • sina-al/pynlp

    A pythonic wrapper for Stanford CoreNLP.

    Language:Python10781911
  • clipperhouse/jargon

    Tokenizers and lemmatizers for Go

    Language:Go103491
  • vhyza/elasticsearch-analysis-lemmagen

    Elasticsearch lemmatizer for 15 languages

    Language:Java10382425
  • explosion/spacy-experimental

    🧪 Cutting-edge experimental spaCy components and features

    Language:Python9511018
  • akoksal/Turkish-Lemmatizer

    Lemmatization for Turkish Language

    Language:Python897110
  • WZBSocialScienceCenter/germalemma

    A lemmatizer for German language text

    Language:Python8613411
  • allegro/elasticsearch-analysis-morfologik

    Morfologik Polish Lemmatizer plugin for Elasticsearch

    Language:Java8281924
  • aaaton/golem

    A lemmatizer implemented in Go

    Language:Go803920
  • Koziev/GrammarEngine

    Грамматический Словарь Русского Языка (+ английский, японский, etc)

    Language:C++7391819
  • sorenlind/lemmy

    🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪

    Language:Python72409
  • uralicNLP

    mikahama/uralicNLP

    An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Spanish, French, Arabic, Swedish, Norwegian, Russian and English

    Language:Python707177
  • sammous/spacy-lefff

    Custom French POS and lemmatizer based on Lefff for spacy

    Language:Python6331712
  • winkjs/wink-lemmatizer

    English lemmatizer

    Language:JavaScript635106
  • biblissima/collatinus

    Sources of Collatinus software - Latin lemmatizer, morphological analyzer and scansion

    Language:JavaScript61104515
  • xiamx/lemma

    A Morphological Parser (Analyser) / Lemmatizer written in Elixir.

    Language:Elixir50203
  • Koziev/rulemma

    Лемматизатор для русскоязычных текстов

    Language:Python40346
  • 360er0/COMBO

    COMBO is jointly trained tagger, lemmatizer and dependency parser.

    Language:Python36478
  • bastienbot/nlp-js-tools-french

    POS Tagger, lemmatizer and stemmer for french language in javascript

    Language:JavaScript36448
  • antixrist/node-phpmorphy

    Полнофункциональный порт phpMorphy на Node.JS

    Language:JavaScript33377
  • kuhumcst/cstlemma

    Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefix, infix, suffix, circumfix). Rules are obtained by supervised learning from a full form - lemma list.

    Language:C++33576
  • sedthh/lara-hungarian-nlp

    NLP class for rapid ChatBot development in Hungarian language

    Language:Python29314
  • big-keva/libmorph

    libmorph rus/ukr - fast & accurate morphological analyzer/analyses for Russian and Ukrainian

    Language:HCL25615
  • alexeyev/mystem-scala

    Morphological analyzer `mystem` (Russian language) wrapper for JVM languages

    Language:Scala243116
  • banglakit/lemmatizer

    A rule-based lemmatizer for Bengali / Bangla based written in Python. Under active development.

    Language:Python22315
  • oeuvres/alix

    A Lucene Indexer for XML, with lexical analysis (lemmatization for French)

    Language:Java16484
  • lamonpy

    bab2min/lamonpy

    Latin POS Tagger & Lemmatizer for Python

    Language:C++15241
  • jonfd/nefnir

    A lemmatizer for Icelandic text

    Language:Python15202