/Xpersona

XPersona: Evaluating Multilingual Personalized Chatbot

Primary LanguagePythonMIT LicenseMIT

XPersona: Evaluating Multilingual Personalized Chatbot

License: MIT

This is the source code of the paper:

XPersona: Evaluating Multilingual Personalized Chatbot. [PDF]

This code has been written using PyTorch. If you use source codes or datasets included in this toolkit in your work, please cite the following papers:

XPersona

@article{lin2020xpersona,
  title={XPersona: Evaluating Multilingual Personalized Chatbot},
  author={Lin, Zhaojiang and Liu, Zihan and Winata, Genta Indra and Cahyawijaya, Samuel and Madotto, Andrea and Bang, Yejin and Ishii, Etsuko and Fung, Pascale},
  journal={arXiv preprint arXiv:2003.07568},
  year={2020}
}

English PersonaChat

@article{zhang2018personalizing,
  title={Personalizing Dialogue Agents: I have a dog, do you have pets too?},
  author={Zhang, Saizheng and Dinan, Emily and Urbanek, Jack and Szlam, Arthur and Kiela, Douwe and Weston, Jason},
  journal={arXiv preprint arXiv:1801.07243},
  year={2018}
}

Dataset

XPersona dataset is an extension of the persona-chat dataset. Specifically, we extend the ConvAI2 to the other six languages: Chinese, French, Indonesian, Italian, Korean, and Japanese.

Baselines

In this work, we provided multilingual and crosslingual trained baselines. See multilingual and crosslingual folder for more details.

Acknowledgement

This repository is implemented using Huggingface codebase.