language-processing

There are 215 repositories under language-processing topic.

  • lingua-go

    pemistahl/lingua-go

    The most accurate natural language detection library for Go, suitable for short text and mixed-language text

    Language:Go1.1k113464
  • MarginaliaSearch/MarginaliaSearch

    Internet search engine for text-oriented websites. Indexing the small, old and weird web.

    Language:HTML89075623
  • lingua-rs

    pemistahl/lingua-rs

    The most accurate natural language detection library for Rust, suitable for short text and mixed-language text

    Language:Rust84485335
  • lingua

    pemistahl/lingua

    The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike

    Language:Kotlin6621112760
  • NirantK/NLP_Quickbook

    NLP in Python with Deep Learning

    Language:Jupyter Notebook563322231
  • classifai

    10up/classifai

    Supercharge WordPress Content Workflows and Engagement with Artificial Intelligence.

    Language:PHP5528336352
  • knadh/dictpress

    A stand-alone web server application for building and publishing full fledged dictionary websites and APIs for any language.

    Language:Go350141739
  • monkeylearn/monkeylearn-python

    Official Python client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Python apps.

    Language:Python16625744
  • MycroftAI/padatious

    A neural network intent parser

    Language:Python158191942
  • WZBSocialScienceCenter/germalemma

    A lemmatizer for German language text

    Language:Python8613411
  • gemengtju/Tutorial_Speech_Signal_Processing

    This repo summarizes the courses and materials for speech signal processing. You are kindly invited to pull requests.

  • monkeylearn/monkeylearn-ruby

    Official Ruby client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Ruby apps.

    Language:Ruby80151014
  • monkeylearn/monkeylearn-php

    Official PHP client for the MonkeyLearn API. Build and consume machine learning models for language processing from your PHP apps.

    Language:PHP501417
  • pytamil

    srix/pytamil

    பைந்தமிழ் (pytamil) library is intended to be used in analysis of tamil literary work. A wealth of knowledge is hidden in old literature. They are time machines to past. Ever wondered what is the popular color or food in tamil speaking world in 500AD. The answer is hidden in literature. With right computer tools it becomes possible for us to dig in to this wealth of knowledge.

    Language:Python48939
  • ysenarath/sinling

    A collection of NLP tools for Sinhalese (සිංහල).

    Language:Jupyter Notebook477616
  • monkeylearn/monkeylearn-node

    Official Node client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Node apps.

    Language:JavaScript4412418
  • parallel-corpora-tools

    M4t1ss/parallel-corpora-tools

    Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.

    Language:PHP405516
  • kariminf/aruudy

    Arabic prosody (Arud) or "Science of Poetry"

    Language:Python383197
  • mako443/Text2Pos-CVPR2022

    Code, dataset and models for our CVPR 2022 publication "Text2Pos"

    Language:Python37397
  • TimKam/schreib-gut

    German extension for write-good

    Language:JavaScript37701
  • mapado/pynlg

    ``pynlg`` is a pure python re-implementation of [SimpleNLG-EnFr](https://github.com/rali-udem/SimpleNLG-EnFr), a java library enabling bilingual [text surface realisation](https://en.wikipedia.org/wiki/Realization_%28linguistics%29), based on [SimpleNLG](https://github.com/simplenlg/simplenlg).

    Language:Python2914710
  • ActiveNick/Unity-SpeechWithLUIS

    Sample Unity project used to demonstrate the integration of Speech Recognition and Language Understanding using the new Microsoft Speech Service (Preview) and LUIS from Microsoft Cognitive Services.

    Language:C#28605
  • versotym/rhymetagger

    A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Spanish poetry

    Language:Python27424
  • monkeylearn/monkeylearn-java

    Official Java client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Java apps.

    Language:Java241345
  • pigoz/lat

    Tools to automate language acquisition through immersion. Includes sentence analysis (from books, subtitles) and Anki cards creation.

    Language:Ruby232160
  • triatebr/aprenda-python

    Aprendizado, dicas e projetos sobre Python

    Language:Jupyter Notebook23207
  • Near32/ReferentialGym

    This framework provides out-of-the-box implementations of Referential Games variants in order to study the emergence of artificial languages using deep learning, relying on PyTorch (https://www.pytorch.org).

    Language:Python18403
  • imsanjoykb/German-Language-Learning-Resource

    German Language Learning Resource

  • martinferianc/C90Compiler-EIE2

    C90 to MIPS I Compiler done as a coursework for EE2-15

    Language:C++16202
  • FORMAS/DptOIE

    Language:Java14616
  • verifid/ner-d

    Python module for Named Entity Recognition (NER) using natural language processing.

    Language:Python14413
  • ishto7/persianutils

    Standardize your Persian text: Preprocessing, Embedding, and more!

    Language:Python13501
  • mujeebishaque/language-detector

    this software detects the language of the website. It goes over list of url provided and saves the url + language in an excel sheet

    Language:Python13201
  • vignif/lex-yacc-SQL-parser

    Simple parser for SQL standard language, this tool is developed using Lex and Yacc, project made for Language Processing Technologies @diism University of Siena. feel free to use it for academic purposes

    Language:C13203
  • lexected/astir

    A flexible parser generator producing output from object-oriented hierarchical context-free grammar specifications.

    Language:C++11230
  • RacimRgh/Dictionnaire-medical-Python-Unitex

    A python scraper that generates a medical dictionnary from vidal.fr, then enhance it using Unitex/Gramlab

    Language:Python11102