valeq's Stars
omwn/omw-data
This packages up data for the Open Multilingual Wordnet
ontolex/frequency-attestation-corpus-information
OntoLex module for Frequency, Attestations and Corpus Information (draft)
CoPhi/CLARIN-learning-resources-for-H2IOSC
H2IOSC Learning Resources
latexdraw/latexdraw
A vector drawing editor for LaTeX (JavaFX).
clarin-eric/ParlaMint
ParlaMint: Comparable Parliamentary Corpora
jmccrae/wordnet-angular
Princeton WordNet Interface based on Angular.js and Rust
globalwordnet/gwadoc
documentation for things like relations and parts of speech used by wordnets
globalwordnet/OMW
The Open Multilingual Wordnet
clarin-eric/resource-families-issues
ontolex/lexinfo
LexInfo - Data Category Ontology for OntoLex-Lemon
clld/wold2
The World Loanword Database
Jmuccigr/temples
Stuff on temples of the Classical world (Greek and Roman and Etruscan, etc)
doccano/doccano
Open source annotation tool for machine learning practitioners.
gucorpling/amalgum
English web corpus with 4M tokens and several annotation types
andreabellandi/LexO-backend
dice-group/LIdioms
A multilingual linked idioms data set.
elexis-eu/tei2ontolex
TEI to OntoLex Conversion
elexis-eu/ontolex2tei
nmfisher/text2
Text labelling/annotation tool (Angular/NodeJS)
anasfkhan81/lemonEty
An etymological extension of the ontolex-lemon model along with two example files
francescapoli98/bachelor-thesis-project
Management of lexicographic resources with the aim of builing a sense inventory. Collab with the ELEXIS project through ILC-CNR for my Bachelor Thesis in Digital Humanities. The Sense Inventory has been published as a resource in the Home Repository of ILC4CLARIN ⬇️
irenepisani/WSD-system
Baseline knowledge-based per il Word Sense Disambiguation
Alir3z4/stop-words
List of common stop words in various languages.
kavgan/nlp-in-practice
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
marcoguerini/paired_datasets_for_persuasion
hazemalsaied/ATILF-LLF.v2
DARIAH-ERIC/lexicalresources
Data space of the DARIAH Lexical Resources Working Group
francescafrontini/MWExtractor
opener-project/coreference-base
Co-reference resolution for the English language.