Irishx's Stars
proycon/foliapy
An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic annotation finding application in Natural Language Processing (NLP). This library was formerly part of PyNLPl.
proycon/foliatools
A number of command-line tools for working with FoLiA (Format for Linguistic Annotation). Includes validators, converters, visualisers, and more.
LanguageMachines/frog
Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
rfdj/SimpleNLG-NL
A Java-based surface realiser for Natural Language Generation in Dutch, based on SimpleNLG (https://github.com/simplenlg/simplenlg)
suzanv/termprofiling
Implementation of the term scoring algorithm in Tomokiyo & Hurst (2003), based on Kullback-Leibler Divergence (kldiv). Given a foreground and background corpus, it returns the most descriptive terms of the foreground corpus in the form of a termcloud
drelhaj/Java_WordCloud_LogLikelihood
Java tool to create word cloud by calculating frequencies and log Likelihood for a word between two large corpora
fkunneman/ADNEXT_collect
Repository with scripts for collecting data from the Web
proycon/LaMachine
LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilation/installation script
cltk/cltk
The Classical Language Toolkit
fkunneman/ADNEXT
The project repository o