This is the source code of the paper:
XPersona: Evaluating Multilingual Personalized Chatbot. [PDF]
This code has been written using PyTorch. If you use source codes or datasets included in this toolkit in your work, please cite the following papers:
XPersona
@article{lin2020xpersona, title={XPersona: Evaluating Multilingual Personalized Chatbot}, author={Lin, Zhaojiang and Liu, Zihan and Winata, Genta Indra and Cahyawijaya, Samuel and Madotto, Andrea and Bang, Yejin and Ishii, Etsuko and Fung, Pascale}, journal={arXiv preprint arXiv:2003.07568}, year={2020} }
English PersonaChat
@article{zhang2018personalizing, title={Personalizing Dialogue Agents: I have a dog, do you have pets too?}, author={Zhang, Saizheng and Dinan, Emily and Urbanek, Jack and Szlam, Arthur and Kiela, Douwe and Weston, Jason}, journal={arXiv preprint arXiv:1801.07243}, year={2018} }
XPersona dataset is an extension of the persona-chat dataset. Specifically, we extend the ConvAI2 to the other six languages: Chinese, French, Indonesian, Italian, Korean, and Japanese.
In this work, we provided multilingual and crosslingual trained baselines. See multilingual and crosslingual folder for more details.
This repository is implemented using Huggingface codebase.