language-processing
There are 255 repositories under language-processing topic.
MarginaliaSearch/MarginaliaSearch
Internet search engine for text-oriented websites. Indexing the small, old and weird web.
pemistahl/lingua-go
The most accurate natural language detection library for Go, suitable for short text and mixed-language text
pemistahl/lingua-rs
The most accurate natural language detection library for Rust, suitable for short text and mixed-language text
pemistahl/lingua
The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
10up/classifai
Supercharge WordPress Content Workflows and Engagement with Artificial Intelligence.
NirantK/NLP_Quickbook
NLP in Python with Deep Learning
knadh/dictpress
A stand-alone web server application for building and publishing full fledged dictionary websites and APIs for any language.
MycroftAI/padatious
A neural network intent parser
gemengtju/Tutorial_Speech_Signal_Processing
This repo summarizes the courses and materials for speech signal processing. You are kindly invited to pull requests.
sefineh-ai/Amharic-Tokenizer
Syllable-aware BPE tokenizer for the Amharic language (አማርኛ) – fast, accurate, trainable.
WZBSocialScienceCenter/germalemma
A lemmatizer for German language text
AdyTech99/volo
An F/OSS solution combining AI with Wikipedia knowledge via a RAG pipeline
ysenarath/sinling
A collection of NLP tools for Sinhalese (සිංහල).
srix/pytamil
பைந்தமிழ் (pytamil) library is intended to be used in analysis of tamil literary work. A wealth of knowledge is hidden in old literature. They are time machines to past. Ever wondered what is the popular color or food in tamil speaking world in 500AD. The answer is hidden in literature. With right computer tools it becomes possible for us to dig in to this wealth of knowledge.
mako443/Text2Pos-CVPR2022
Code, dataset and models for our CVPR 2022 publication "Text2Pos"
M4t1ss/parallel-corpora-tools
Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.
TimKam/schreib-gut
German extension for write-good
imsanjoykb/German-Language-Learning-Resource
German Language Learning Resource
versotym/rhymetagger
A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Spanish poetry
ActiveNick/Unity-SpeechWithLUIS
Sample Unity project used to demonstrate the integration of Speech Recognition and Language Understanding using the new Microsoft Speech Service (Preview) and LUIS from Microsoft Cognitive Services.
mapado/pynlg
``pynlg`` is a pure python re-implementation of [SimpleNLG-EnFr](https://github.com/rali-udem/SimpleNLG-EnFr), a java library enabling bilingual [text surface realisation](https://en.wikipedia.org/wiki/Realization_%28linguistics%29), based on [SimpleNLG](https://github.com/simplenlg/simplenlg).
pigoz/lat
Tools to automate language acquisition through immersion. Includes sentence analysis (from books, subtitles) and Anki cards creation.
triatebr/aprenda-python
Aprendizado, dicas e projetos sobre Python
Near32/ReferentialGym
This framework provides out-of-the-box implementations of Referential Games variants in order to study the emergence of artificial languages using deep learning, relying on PyTorch (https://www.pytorch.org).
RMNCLDYO/groq-ai-toolkit
A lightweight Python API wrapper and CLI for Groq’s offering of language models using their ultra fast LPU Inference Engine.
searchpioneer/lingua-dotnet
Natural language detection library for .NET, suitable for long and short text alike
mujeebishaque/language-detector
this software detects the language of the website. It goes over list of url provided and saves the url + language in an excel sheet
martinferianc/C90Compiler-EIE2
C90 to MIPS I Compiler done as a coursework for EE2-15
ishto7/persianutils
Standardize your Persian text: Preprocessing, Embedding, and more!
vignif/lex-yacc-SQL-parser
Simple parser for SQL standard language, this tool is developed using Lex and Yacc, project made for Language Processing Technologies @diism University of Siena. feel free to use it for academic purposes
verifid/ner-d
Python module for Named Entity Recognition (NER) using natural language processing.
lexected/astir
A flexible parser generator producing output from object-oriented hierarchical context-free grammar specifications.
melchisedech333/antlr4-experiments
:wrench: My studies on context-free grammar, using ANTLR4 (C++) to generate the parser files. Some basics are developed, such as token processing, recursion, variable definition, array processing, Abstract Syntax Tree (AST) manipulation, UNICODE support, and error handling.
RacimRgh/Dictionnaire-medical-Python-Unitex
A python scraper that generates a medical dictionnary from vidal.fr, then enhance it using Unitex/Gramlab
shamspias/google-meet-translator-extension
Google Meet Transcript Translator is a Chrome extension that translates live transcriptions during a Google Meet call into your chosen language. Enhance your global communication.