Pinned Repositories
ContraDecode
The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Decoding"
ContraPro
Contrastive evaluation of pronoun translation in neural machine translation
ContraWSD
Word sense disambiguation test sets for NMT
coverage-contrastive-conditioning
Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive Conditioning" (ACL 2022)
mbr
Minimum Bayes Risk Decoding for Hugging Face Transformers
multilingual-instruction-tuning
Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"
nmtscore
A library of translation-based text similarity measures
swissbert
The multilingual language model for Switzerland
understanding-mbr
xstance
A Multilingual Multi-Target Dataset for Stance Detection
ZurichNLP's Repositories
ZurichNLP/mbr
Minimum Bayes Risk Decoding for Hugging Face Transformers
ZurichNLP/xstance
A Multilingual Multi-Target Dataset for Stance Detection
ZurichNLP/ContraDecode
The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Decoding"
ZurichNLP/nmtscore
A library of translation-based text similarity measures
ZurichNLP/swissbert
The multilingual language model for Switzerland
ZurichNLP/multilingual-instruction-tuning
Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"
ZurichNLP/coverage-contrastive-conditioning
Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive Conditioning" (ACL 2022)
ZurichNLP/understanding-mbr
ZurichNLP/segtest
A Test Suite for Morphological Phenomena in Neural Machine Translation
ZurichNLP/BLESS
Code for the EMNLP 2023 paper "BLESS: Benchmarking Large Language Models on Sentence Simplification"
ZurichNLP/mbr-sensitivity
Data and code for the paper "Identifying Weaknesses in Machine Translation Metrics Through Minimum Bayes Risk Decoding: A Case Study for COMET"
ZurichNLP/sdg_swisstext_2024_sharedtask
Repository for data and evaluation of 2024 Shared Task on SDG classification held by the Swiss Text Conference.
ZurichNLP/translation-direction-detection
Unsupervised translation direction detection using NMT systems
ZurichNLP/20Minuten
ZurichNLP/acl2020-historical-text-normalization
Code for the ACL 2020 paper "Semi-supervised Contextual Historical Text Normalization" by Peter Makarov and Simon Clematide
ZurichNLP/contrastive-conditioning
Code and data accompanying the paper "Contrastive Conditioning for Assessing Disambiguation in MT: A Case Study of Distilled Bias"
ZurichNLP/distil-lingeval
Data and code accompanying the paper "On the Limits of Minimal Pairs in Contrastive Evaluation"
ZurichNLP/MultiPivotNMT
The implementation of "Investigating Multi-Pivot Ensembling with Massively Multilingual Machine Translation Models"
ZurichNLP/recognizing-semantic-differences
Code for the paper "Towards Unsupervised Recognition of Token-level Semantic Differences in Related Documents"
ZurichNLP/specific_hospo_respo
Code for hospitality review response generation
ZurichNLP/swiss-german-text-encoders
Code for the paper "Modular Adaptation of Multilingual Encoders to Written Swiss German Dialect"
ZurichNLP/voting-booklet-bias
Code for the paper "Voting Booklet Bias: Stance Detection in Swiss Federal Communication"
ZurichNLP/romanisation-transfer
Code for the Paper "On Romanization for Model Transfer Between Scripts in Neural Machine Translation"
ZurichNLP/understanding-ctx-aug
Code for the 2023 ACL Findings paper, Uncovering Hidden Consequences of Pre-training Objectives in Sequence-to-Sequence Models (Kew & Sennrich, 2023)
ZurichNLP/llm-response-stability
Data and code for the paper "Yes, no, maybe? Revisiting language models' response stability under paraphrasing for the assessment of political leaning"
ZurichNLP/SimpleFUDGE
Code for the paper "Target-Level Sentence Simplification as Controlled Paraphrasing" (TSAR 2022)
ZurichNLP/simplewiki-data-acquisition
ZurichNLP/sockeye
Sequence-to-sequence framework with a focus on Neural Machine Translation based on Apache MXNet
ZurichNLP/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
ZurichNLP/window_audio_segmentation
Code and data for the paper "Don't Discard Fixed-Window Audio Segmentation in Speech-to-Text Translation"