DFKI-NLP

Speech and Language Technology (SLT) Group of the Berlin lab of the German Research Center for Artificial Intelligence (DFKI)

Berlin, Germany

Pinned Repositories

DISTRE
[ACL 19] Fine-tuning Pre-Trained Transformer Language Models to Distantly Supervised Relation Extraction
Language:Python85 6 913
fewie
Few-shot named entity recognition
Language:Python11 3 31
lrv
Layerwise Relevance Visualization in Convolutional Text Graph Classifiers
Language:Python12 4 02
MobIE
[Konvens21] This repository contains the DFKI MobIE Corpus, a dataset of 3,232 German-language documents that have been annotated with fine-grained geo-entities, such as streets, stops and routes, as well as standard named entity types (organization, date, number, etc).
Language:Python11 2 00
product-corpus
This repository contains the DFKI Product Corpus, a dataset of 174 documents annotated for product and company named entities, and the relation CompanyProvidesProduct.
12 2 02
RelEx
RelEx - A simple framework for Relation Extraction built on AllenNLP
Language:Jsonnet16 5 31
REval
[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction
Language:Python13 4 04
tacrev
[ACL 20] TACRED Revisited: A Thorough Evaluation of the TACRED Relation Extraction Task
Language:Jupyter Notebook69 5 48
thermostat
Collection of NLP model explanations and accompanying analysis tools
Language:Jsonnet143 5 138
TRE
[AKBC 19] Improving Relation Extraction by Pre-trained Language Representations
Language:Python108 8 812

DFKI-NLP's Repositories

DFKI-NLP/DISTRE
[ACL 19] Fine-tuning Pre-Trained Transformer Language Models to Distantly Supervised Relation Extraction
Language:Python85 6 913
DFKI-NLP/product-corpus
This repository contains the DFKI Product Corpus, a dataset of 174 documents annotated for product and company named entities, and the relation CompanyProvidesProduct.
12 2 02
DFKI-NLP/MobIE
[Konvens21] This repository contains the DFKI MobIE Corpus, a dataset of 3,232 German-language documents that have been annotated with fine-grained geo-entities, such as streets, stops and routes, as well as standard named entity types (organization, date, number, etc).
Language:Python11 2 00
DFKI-NLP/LLMCheckup
Code for the NAACL 2024 HCI+NLP Workshop paper "LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tools and Self-explanation" (Wang et al. 2024)
Language:Python10 5 181
DFKI-NLP/MultiTACRED
[ACL23] This repository contains the code for our paper "MultiTACRED: A Multilingual Version of the TAC Relation Extraction Dataset"
Language:Python8 3 1
DFKI-NLP/smartdata-corpus
A dataset of almost 2600 German-language documents which has been annotated with fine-grained geo-entities, standard named entity types, and a set of 15 traffic- and industry-related relations.
8 2 00
DFKI-NLP/SMV
Code and data for the ACL 2023 NLReasoning Workshop paper "Saliency Map Verbalization: Comparing Feature Importance Representations from Model-free and Instruction-based Methods" (Feldhus et al., 2023)
Language:Python8 2 161
DFKI-NLP/defx
[SemEval 2020] Defx at SemEval-2020 Task 6: Joint Extraction of Concepts and Relations for Definition Extraction
Language:Jupyter Notebook7 3 11
DFKI-NLP/diamat
Machine Translation Diagnostics Tool
Language:Python6 4 14
DFKI-NLP/InterroLang
InterroLang: Exploring NLP Models and Datasets through Dialogue-based Explanations [EMNLP 2023 Findings]
Language:Python5 3 521
DFKI-NLP/pegasus-bridle
Ease your training experience on the DFKI GPU cluster :unicorn:
Language:Shell5 1 6
DFKI-NLP/CoXQL
CoXQL: A Dataset for Parsing Explanation Requests in Conversational XAI Systems
Language:Python2 2 00
DFKI-NLP/dfki-nlp.github.io
https://dfki-nlp.github.io
Language:TeX2 6 20
DFKI-NLP/sim3s-corpus
Corpus MobASA: a German-language corpus of tweets annotated with their relevance for public transportation, and with sentiment towards aspects related to barrier-free travel.
Language:Python2 1 00
DFKI-NLP/ADE_templates
This project contains templates and evaluation of models with these templates for the task of Adverse Drug Effect (ADE) detection.
Language:Jupyter Notebook1
DFKI-NLP/CockrACE-corpus
The “CockrACE” corpus consists of 140 news articles annotated with mentions of entities and their coreference links, as well as relation mentions for the evaluation of relation extraction (RE) experiments. Three semantic relations have been annotated, each of them dealing with people's family relationships (marriages, brother/sister, parent/child).
Language:XML1
DFKI-NLP/for-classifier
📚 Code for my master's thesis "Investigating Knowledge Injection Approaches for Research Field Classification of Scholarly Articles".
Language:Python10
DFKI-NLP/keepha_annotation_guidelines
Language:TeX1 0 0
DFKI-NLP/nfdi4ds-forc
Repository for constructing a dataset for the Field of Research Classification task.
Language:Jupyter Notebook1 1 02
DFKI-NLP/pynegex
PyNegEx pypi modular package for negex
Language:Python1 1 00
DFKI-NLP/semisupervised-mt-qe
Scripts and data to reproduce the experiments of the paper Bhatia et. al 2023, " Semi-supervised learning for Quality Estimation of Machine Translation" - MT Summit 2023
Language:Jupyter Notebook1 3 0
DFKI-NLP/Taxonomy4CL
Taxonomy for Computational Linguistics topics
Language:Python11
DFKI-NLP/tohyve-services
Language:Python1 4 41
DFKI-NLP/ACL2024-SymmetricAttentionBert
Language:Jupyter Notebook0 0
DFKI-NLP/bias-memit
Mass-Editing Stereotypical Associations to Mitigate Bias in Language Models
Language:Jupyter Notebook4 0
DFKI-NLP/celebrity-corpus
The “Celebrity” corpus consists of 150 news articles annotated with three semantic relations of the biographic domain. The corpus is provided in two formats, a CoNLL-like format (plain-text files with tabular-separated values) and an XML-based format. Files in the XML-based format can be loaded with https://github.com/DFKI-NLP/recon.
Language:XML1
DFKI-NLP/faq-rewrites-llms
This repository contains the dataset of FAQ texts rewritten using large language models described in our INLG 2024 paper "Enhancing Editorial Tasks: A Case Study on Rewriting Customer Help Page Contents Using Large Language Models".
DFKI-NLP/perseus-textgen
A repository for scripts to run awesomely large language models with text generation inference APIs and (chat) UIs
Language:Shell
DFKI-NLP/radiant
Web RAdio and auDIo processing system for Acessibility aNd Traceability
DFKI-NLP/subjective_text_complexity_corpus
A corpus consisting of German sentences, annotated with subjective complexity ratings by two target groups
0 0