Pinned Repositories
awesome-human-label-variation
A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, accompanying The 'Problem' of Human Label Variation: On Ground Truth in Data, Modeling and Evaluation (EMNLP 2022)
CrossRE
CrossRE: A Cross-Domain Dataset for Relation Extraction (Findings of EMNLP 2022)
dialect-BLI
Dialect-BLI project repo
escoxlmr
Repository for the paper ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain (ACL2023)
germanic-lrl-corpora
A survey of corpora for Germanic low-resource languages and dialects
How-to-distill-your-BERT
Code for the paper: How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives (ACL 2023)
logme-nlp
Code for Evidence > Intuition: Transferability Estimation for Encoder Selection (EMNLP 2022)
mainlp.github.io
MaiNLP research lab
semantic_components
Finding semantic components in your neural representations.
spectral-probing
Spectral Probing (EMNLP 2022)
MaiNLP's Repositories
mainlp/awesome-human-label-variation
A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, accompanying The 'Problem' of Human Label Variation: On Ground Truth in Data, Modeling and Evaluation (EMNLP 2022)
mainlp/CrossRE
CrossRE: A Cross-Domain Dataset for Relation Extraction (Findings of EMNLP 2022)
mainlp/germanic-lrl-corpora
A survey of corpora for Germanic low-resource languages and dialects
mainlp/semantic_components
Finding semantic components in your neural representations.
mainlp/mainlp.github.io
MaiNLP research lab
mainlp/inferential-strategies
In this project, we evaluate inferential strategies employed by large language models in propositional logic problems and compare them to strategies observed in humans.
mainlp/MCQ-Mismatch
mainlp/MCQ-Robustness
mainlp/xsid
mainlp/BarNER
mainlp/maibaam-code
Code for preprocessing data for UD annotations and for tagging/parsing experiments of MaiBaam
mainlp/MJD-Estimator
Implementation of the EMNLP 2024 paper from the MaiNLP Lab - "Seeing the Big through the Small": Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?
mainlp/NaLiBaSID
Repository with data and code for "Slot and Intent Detection Resources for Bavarian and Lithuanian: Assessing Translations vs Natural Queries to Digital Assistants"
mainlp/NER-disagreements
mainlp/nnose
Codebase for NNOSE: Nearest Neighbor Occupational Skill Extraction
mainlp/subspace-chronicles
How Linguistic Information Emerges, Shifts and Interacts during Language Model Training (EMNLP 2023)
mainlp/TruthQuest
We introduce TruthQuest, a benchmark designed to evaluate the suppositional reasoning capabilities of large language models through knights and knaves puzzles.
mainlp/VariErr-NLI
mainlp/ClimatELi
CLIMATELI: Evaluating Entity Linking on Climate Change Data
mainlp/dialect-ToD-robustness
Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German Varieties (EACL 2024)
mainlp/common-voice
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
mainlp/donkii
mainlp/Eevee
An Easy Annotation Tool for Natural Language Processing
mainlp/el_esco
Codebase for Entity Linking in the Job Market Domain
mainlp/JointDeBERTa
Pytorch implementation of JointBERT: "BERT for Joint Intent Classification and Slot Filling"
mainlp/JUDGE-BENCH
mainlp/label-variation-nli
Code used in More Labels or Cases? Assessing Label Variation in Natural Language Inference.
mainlp/RC-analysis
Code for "What’s wrong with your model? A Quantitative Analysis of Relation Classification"
mainlp/SurvAI
mainlp/tot-eval