Pinned Repositories
ba-text-mining
Hands-on material for the course text-mining BA, taught at VU Amsterdam
EventCoreference
Compares descriptions of events within and across documents to decide if they refer to the same events.
EventStoryLine
Materials for the StoryLine extraction task - annotated data, baselines and evaluation scripts, evaluation data.
KafNafParserPy
Parser for KAF NAF files written in Python
OpenDutchWordnet
This repo provides a python module to work with Open Dutch WordNet. It was created using python 3.4.
pepper
VU-CLTL Pepper/Nao Application Repository (Python 2)
python-for-text-analysis
If you want to use Python for text analysis, this course is for you!
SpaCy-to-NAF
spaCy-to-naf converter
ThesisTips
A collection of tips for writing a PhD thesis
wsd-dynamic-sense-vector
Computational Linguistics and & Text Mining Lab's Repositories
cltl/python-for-text-analysis
If you want to use Python for text analysis, this course is for you!
cltl/ba-text-mining
Hands-on material for the course text-mining BA, taught at VU Amsterdam
cltl/SpaCy-to-NAF
spaCy-to-naf converter
cltl/ma-hlt-labs
Human Language Technology Notebooks for Lab sessions, Master Students
cltl/ma-ml4nlp-labs
Course code for "Machine Learning in NLP"
cltl/entity-identification-from-scratch
Entity recognition and linking for historical documents in Dutch, developed within the Clariah+ project at VU Amsterdam
cltl/event-resource-interoperability
cltl/aproof-icf-classifier
Classifier that can read medical reports and assign a functional level classification following the WHO ICF classification scheme.
cltl/cltl-ma-thesis
(LaTeX) MA thesis template
cltl/ma-communicative-robots
Communication robots
cltl/reference-framing-perspective
Workshop website
cltl/FrameNetNLTK
cltl/cltl.github.io
CLTL organization site
cltl/rfp_corpus_collection
Collect a referentially grounded corpus for the 1st workshop on Reference, Framing, and Perspective (LREC-COLING 2024)
cltl/bibliography
CLTL bibtex bibliography
cltl/cltl-homepage
cltl/DominantFrameLabeler
cltl/dreamslab
cltl/event-classification-tool
cltl/grounding-toxicity
Code base for the paper Grounding Toxicity in Real-World Events across Languages
cltl/InappropriateLanguageDetection
This repository contains annotated data on inappropriate language in online discussions, generated through a combination of expert annotation, crowd-sourcing, and ChatGPT-based methods.
cltl/Lingoturk
Creating crowdsourcing based experiments made easy
cltl/panli
Perspective-Aware Natural Language Inference
cltl/panli-crowdtruth
A CrowdTruth analysis of the PANLI dataset
cltl/panli-models
Model evaluation on the PANLI dataset
cltl/Reddit_topic_toxicity
Toxicity analysis of Reddit conversation across topics and languages
cltl/span-annotation-tool
cltl/unkown_script
Code base for the paper Unknown Script: Impact of Script on Cross-Lingual Transfer
cltl/Wiktionary_Reader
cltl/word2vec_using_gensim