Slot and Intent Detection for Low Resource language varieties (SID4LR): Dialect Extension

This repository contains the dialect extension to the xSID corpus by van der Goot et al. (NAACL 2021). The corpus was extended with Neapolitan and Swiss German (Bernese) dialect translations. With the extended corpus we created the shared task Slot and Intent Detection for Low-Resource Language Varieties (SID4LR) which we organized as part of the Natural Language Processing for Similar Language Varieties and Dialects (VarDial) workshop co-located with EACL 2023.

The shared task is available here: https://bitbucket.org/robvanderg/sid4lr/src/master/

Please cite the paper Findings of the VarDial Evaluation Campaign 2023 for the SID4LR shared task and if you use v4.0 of the xSID corpus.

@inproceedings{aepli-etal-2023-findings,
    title = "Findings of the {V}ar{D}ial Evaluation Campaign 2023",
    author = {Aepli, No{\"e}mi  and
      {\c{C}}{\"o}ltekin, {\c{C}}a{\u{g}}r{\i}  and
      Van Der Goot, Rob  and
      Jauhiainen, Tommi  and
      Kazzaz, Mourhaf  and
      Ljube{\v{s}}i{\'c}, Nikola  and
      North, Kai  and
      Plank, Barbara  and
      Scherrer, Yves  and
      Zampieri, Marcos},
    booktitle = "Tenth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2023)",
    month = may,
    year = "2023",
    address = "Dubrovnik, Croatia",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2023.vardial-1.25",
    pages = "251--261",
    abstract = "This report presents the results of the shared tasks organized as part of the VarDial Evaluation Campaign 2023. The campaign is part of the tenth workshop on Natural Language Processing (NLP) for Similar Languages, Varieties and Dialects (VarDial), co-located with EACL 2023. Three separate shared tasks were included this year: Slot and intent detection for low-resource language varieties (SID4LR), Discriminating Between Similar Languages {--} True Labels (DSL-TL), and Discriminating Between Similar Languages {--} Speech (DSL-S). All three tasks were organized for the first time this year.",
}