DFKI-NLP
Speech and Language Technology (SLT) Group of the Berlin lab of the German Research Center for Artificial Intelligence (DFKI)
Berlin, Germany
Pinned Repositories
DISTRE
[ACL 19] Fine-tuning Pre-Trained Transformer Language Models to Distantly Supervised Relation Extraction
fewie
Few-shot named entity recognition
lrv
Layerwise Relevance Visualization in Convolutional Text Graph Classifiers
MobIE
[Konvens21] This repository contains the DFKI MobIE Corpus, a dataset of 3,232 German-language documents that have been annotated with fine-grained geo-entities, such as streets, stops and routes, as well as standard named entity types (organization, date, number, etc).
product-corpus
This repository contains the DFKI Product Corpus, a dataset of 174 documents annotated for product and company named entities, and the relation CompanyProvidesProduct.
RelEx
RelEx - A simple framework for Relation Extraction built on AllenNLP
REval
[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction
tacrev
[ACL 20] TACRED Revisited: A Thorough Evaluation of the TACRED Relation Extraction Task
thermostat
Collection of NLP model explanations and accompanying analysis tools
TRE
[AKBC 19] Improving Relation Extraction by Pre-trained Language Representations
DFKI-NLP's Repositories
DFKI-NLP/DISTRE
[ACL 19] Fine-tuning Pre-Trained Transformer Language Models to Distantly Supervised Relation Extraction
DFKI-NLP/product-corpus
This repository contains the DFKI Product Corpus, a dataset of 174 documents annotated for product and company named entities, and the relation CompanyProvidesProduct.
DFKI-NLP/MobIE
[Konvens21] This repository contains the DFKI MobIE Corpus, a dataset of 3,232 German-language documents that have been annotated with fine-grained geo-entities, such as streets, stops and routes, as well as standard named entity types (organization, date, number, etc).
DFKI-NLP/LLMCheckup
Code for the NAACL 2024 HCI+NLP Workshop paper "LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tools and Self-explanation" (Wang et al. 2024)
DFKI-NLP/MultiTACRED
[ACL23] This repository contains the code for our paper "MultiTACRED: A Multilingual Version of the TAC Relation Extraction Dataset"
DFKI-NLP/smartdata-corpus
A dataset of almost 2600 German-language documents which has been annotated with fine-grained geo-entities, standard named entity types, and a set of 15 traffic- and industry-related relations.
DFKI-NLP/SMV
Code and data for the ACL 2023 NLReasoning Workshop paper "Saliency Map Verbalization: Comparing Feature Importance Representations from Model-free and Instruction-based Methods" (Feldhus et al., 2023)
DFKI-NLP/defx
[SemEval 2020] Defx at SemEval-2020 Task 6: Joint Extraction of Concepts and Relations for Definition Extraction
DFKI-NLP/diamat
Machine Translation Diagnostics Tool
DFKI-NLP/InterroLang
InterroLang: Exploring NLP Models and Datasets through Dialogue-based Explanations [EMNLP 2023 Findings]
DFKI-NLP/pegasus-bridle
Ease your training experience on the DFKI GPU cluster :unicorn:
DFKI-NLP/CoXQL
CoXQL: A Dataset for Parsing Explanation Requests in Conversational XAI Systems
DFKI-NLP/dfki-nlp.github.io
https://dfki-nlp.github.io
DFKI-NLP/sim3s-corpus
Corpus MobASA: a German-language corpus of tweets annotated with their relevance for public transportation, and with sentiment towards aspects related to barrier-free travel.
DFKI-NLP/ADE_templates
This project contains templates and evaluation of models with these templates for the task of Adverse Drug Effect (ADE) detection.
DFKI-NLP/CockrACE-corpus
The “CockrACE” corpus consists of 140 news articles annotated with mentions of entities and their coreference links, as well as relation mentions for the evaluation of relation extraction (RE) experiments. Three semantic relations have been annotated, each of them dealing with people's family relationships (marriages, brother/sister, parent/child).
DFKI-NLP/for-classifier
📚 Code for my master's thesis "Investigating Knowledge Injection Approaches for Research Field Classification of Scholarly Articles".
DFKI-NLP/keepha_annotation_guidelines
DFKI-NLP/nfdi4ds-forc
Repository for constructing a dataset for the Field of Research Classification task.
DFKI-NLP/pynegex
PyNegEx pypi modular package for negex
DFKI-NLP/semisupervised-mt-qe
Scripts and data to reproduce the experiments of the paper Bhatia et. al 2023, " Semi-supervised learning for Quality Estimation of Machine Translation" - MT Summit 2023
DFKI-NLP/Taxonomy4CL
Taxonomy for Computational Linguistics topics
DFKI-NLP/tohyve-services
DFKI-NLP/ACL2024-SymmetricAttentionBert
DFKI-NLP/bias-memit
Mass-Editing Stereotypical Associations to Mitigate Bias in Language Models
DFKI-NLP/celebrity-corpus
The “Celebrity” corpus consists of 150 news articles annotated with three semantic relations of the biographic domain. The corpus is provided in two formats, a CoNLL-like format (plain-text files with tabular-separated values) and an XML-based format. Files in the XML-based format can be loaded with https://github.com/DFKI-NLP/recon.
DFKI-NLP/faq-rewrites-llms
This repository contains the dataset of FAQ texts rewritten using large language models described in our INLG 2024 paper "Enhancing Editorial Tasks: A Case Study on Rewriting Customer Help Page Contents Using Large Language Models".
DFKI-NLP/perseus-textgen
A repository for scripts to run awesomely large language models with text generation inference APIs and (chat) UIs
DFKI-NLP/radiant
Web RAdio and auDIo processing system for Acessibility aNd Traceability
DFKI-NLP/subjective_text_complexity_corpus
A corpus consisting of German sentences, annotated with subjective complexity ratings by two target groups