/Awesome-Role-Play-Papers

Awesome papers for role-playing with language models

Awesome-Role Playing-Papers

Awesome License: MIT

List of Papers, Datasets and Code Repositories for Role-Playing with language models. This repo contains a majority of research works in the role-playing field, but it still may not encompass all the noteworthy works.

🚀🚀🚀 Our thoughts and review of role-playing are in this survey: The Oscars of AI Theater: A Survey on Role-Playing with Language Models.

If you find we miss some related works or have wrong clarifications, please contact us or open issues!

This repo is under W.I.P. Please feel free to open issues and make PRs!

Keywords Convention

The key contribution in the work.

The key contribution in the work.

The work is related to Large language models.

1. Data

1.1 Character-based Role-Playing Datasets:

  • Characteristic AI Agents via Large Language Models [COLING 2024] [Code]

    Xi Wang, Hongliang Dai, Shen Gao, Piji Li

  • TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models [ACL 2024] [Code]

    Jaewoo Ahn, Taehyun Lee, Junyoung Lim, Jin-Hwa Kim, Sangdoo Yun, Hwaran Lee, Gunhee Kim

  • Evaluating Character Understanding of Large Language Models via Character Profiling from Fictional Works [Arxiv] [Code]

    Xinfeng Yuan, Siyu Yuan, Yuhan Cui, Tianhe Lin, Xintao Wang, Rui Xu, Jiangjie Chen, Deqing Yang

  • Character is Destiny: Can Large Language Models Simulate Persona-Driven Decisions in Role-Playing? [Arxiv]

    Rui Xu, Xintao Wang, Jiangjie Chen, Siyu Yuan, Xinfeng Yuan, Jiaqing Liang, Zulong Chen, Xiaoqing Dong, Yanghua Xiao

  • PIPPA: A Partially Synthetic Conversational Dataset [Arxiv] [Code]

    Tear Gosling, Alpin Dale, Yinhe Zheng

  • Rolellm: Benchmarking, eliciting, and enhancing role-playing abilities of large language models. [Arxiv] [Code]

    Zekun Moore Wang, Zhongyuan Peng, Haoran Que, Jiaheng Liu, Wangchunshu Zhou, Yuhan Wu, Hongcheng Guo, Ruitong Gan, Zehao Ni, Jian Yang, Man Zhang, Zhaoxiang Zhang, Wanli Ouyang, Ke Xu, Stephen W. Huang, Jie Fu, Junran Peng

  • Character-LLM: A Trainable Agent for Role-Playing [emnlp2023] [Code]

    Yunfan Shao, Linyang Li, Junqi Dai, Xipeng Qiu

  • RoleInteract: Evaluating the Social Interaction of Role-Playing Agents [Arxiv] [Code]

    Hongzhan Chen, Hehong Chen, Ming Yan, Wenshen Xu, Xing Gao, Weizhou Shen, Xiaojun Quan, Chenliang Li, Ji Zhang, Fei Huang, Jingren Zhou

  • Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment [Arxiv] [Code]

    Keming Lu, Bowen Yu, Chang Zhou, Jingren Zhou

  • CharacterEval: A Chinese Benchmark for Role-Playing Conversational Agent Evaluation [Arxiv] [Code]

    Quan Tu, Shilong Fan, Zihang Tian, Rui Yan

  • ChatHaruhi: Reviving Anime Character in Reality via Large Language Model [Arxiv] [Code]

    Cheng Li, Ziang Leng, Chenxi Yan, Junyi Shen, Hao Wang, Weishi MI, Yaying Fei, Xiaoyang Feng, Song Yan, HaoSheng Wang, Linkang Zhan, Yaokai Jia, Pingyu Wu, Haozhen Sun

  • PRODIGy: a PROfile-based DIalogue Generation dataset [Arxiv]

    Daniela Occhipinti, Serra Sinem Tekiroglu, Marco Guerini

  • CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models [Arxiv] [Code]

    Jinfeng Zhou, Zhuang Chen, Dazhen Wan, Bosi Wen, Yi Song, Jifan Yu, Yongkang Huang, Libiao Peng, Jiaming Yang, Xiyao Xiao, Sahand Sabour, Xiaohan Zhang, Wenjing Hou, Yijia Zhang, Yuxiao Dong, Jie Tang, Minlie Huang

  • ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human [Arxiv] [Code]

    Junfeng Tian, Hehong Chen, Guohai Xu, Ming Yan, Xing Gao, Jianhai Zhang, Chenliang Li, Jiayi Liu, Wenshen Xu, Haiyang Xu, Qi Qian, Wei Wang, Qinghao Ye, Jiejing Zhang, Ji Zhang, Fei Huang, Jingren Zhou

  • Large Language Models Meet Harry Potter: A Bilingual Dataset for Aligning Dialogue Agents with Characters [EMNLP2023] [code]

    Nuo Chen, Yan Wang, Haiyun Jiang, Deng Cai, Yuhan Li, Ziyang Chen, Longyue Wang, Jia Li

  • Beyond Retrieval: Embracing Compressive Memory in Real-World Long-Term Conversations [Arxiv] [code]

    Nuo Chen, Yan Wang, Haiyun Jiang, Deng Cai, Yuhan Li, Ziyang Chen, Longyue Wang, Jia Li

  • CharacterChat: Learning towards Conversational AI with Personalized Social Support [Arxiv] [code]

    Quan Tu, Chuanqi Chen, Jinpeng Li, Yanran Li, Shuo Shang, Dongyan Zhao, Ran Wang, Rui Yan

  • LaMP: When Large Language Models Meet Personalization [Arxiv]

    Alireza Salemi, Sheshera Mysore, Michael Bendersky, Hamed Zamani

1.2 Persona-based Role-Playing Datasets:

  • LiveChat: A Large-Scale Personalized Dialogue Dataset Automatically Constructed from Live Streaming. [ACL2023] [code]

    Jingsheng Gao, Yixin Lian, Ziyi Zhou, Yuzhuo Fu, Baoyuan Wang

  • When Crowd Meets Persona: Creating a Large-Scale Open-Domain Persona Dialogue Corpus. [Arxiv]

    Won Ik Cho, Yoon Kyung Lee, Seoyeon Bae, Jihwan Kim, Sangah Park, Moosung Kim, Sowon Hahn, Nam Soo Kim

  • RealPersonaChat: A Realistic Persona Chat Corpus with Interlocutors’ Own Personalities. [PACLIC2023] [Code]

    Sanae Yamashita, Koji Inoue, Ao Guo, Shota Mochizuki, Tatsuya Kawahara, Ryuichiro Higashinaka

  • Multimodal Persona Based Generation of Comic Dialogs. [ACL2023] [Code]

    Harsh Agrawal, Aditya Mishra, Manish Gupta, Mausam

  • Enhancing Chat Language Models by Scaling High-quality Instructional Conversations [EMNLP2023] [Code]

    Ning Ding, Yulin Chen, Bokai Xu, Yujia Qin, Zhi Zheng, Shengding Hu, Zhiyuan Liu, Maosong Sun, Bowen Zhou

  • MPCHAT: Towards Multimodal Persona-Grounded Conversation [ACL2023] [code]

    Jaewoo Ahn, Yeda Song, Sangdoo Yun, Gunhee Kim

  • Large Language Models as Source Planner for Personalized Knowledge-grounded Dialogue [Arxiv]

    Hongru Wang, Minda Hu, Yang Deng, Rui Wang, Fei Mi, Weichao Wang, Yasheng Wang, Wai-Chung Kwan, Irwin King, Kam-Fai Wong

  • Long Time No See! Open-Domain Conversation with Long-Term Persona Memory [ACL2022] [code]

    Xinchao Xu, Zhibin Gou, Wenquan Wu, Zheng-Yu Niu, Hua Wu, Haifeng Wang, Shihang Wang

  • Profile Consistency Identification for Open-domain Dialogue Agents [EMNLP2022]

    Haoyu Song, Yan Wang, Wei-Nan Zhang, Zhengyu Zhao, Ting Liu, Xiaojiang Liu

  • Pchatbot: A large-scale dataset for personalized chatbot [SIGIR2021] [code]

    Hongjin Qian, Xiaohe Li, Hanxun Zhong, Yu Guo, Yueyuan Ma, Yutao Zhu, Zhanliang Liu, Zhicheng Dou, Ji-Rong Wen

  • Personalized Dialogue Generation with Diversified Traits [Arxiv]

    Yinhe Zheng, Guanyi Chen, Minlie Huang, Song Liu, Xuan Zhu

  • Personalizing Dialogue Agents: I have a dog, do you have pets too? [ACL2018] [code]

    Saizheng Zhang, Emily Dinan, Jack Urbanek, Arthur Szlam, Douwe Kiela, Jason Weston

  • Wizard of Wikipedia: Knowledge-Powered Conversational Agents [ICLR2018] [code]

    Emily Dinan, Stephen Roller, Kurt Shuster, Angela Fan, Michael Auli, Jason Weston

  • FriendsQA: Open-Domain Question Answering on TV Show Transcripts [SIGDIAL2019]

    Zhengzhe Yang, Jinho D. Choi

  • Call for Customized Conversation: Customized Conversation Grounding Persona and Knowledge [Arxiv] [Code]

    Yoonna Jang, Jungwoo Lim, Yuna Hur, Dongsuk Oh, Suhyune Son, Yeonsoo Lee, Donghoon Shin, Seungryong Kim, Heuiseok Lim

2. Alignment

  • Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment [Arxiv] [Code]

    Keming Lu, Bowen Yu, Chang Zhou, Jingren Zhou

  • CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models [Arxiv] [Code]

    Jinfeng Zhou, Zhuang Chen, Dazhen Wan, Bosi Wen, Yi Song, Jifan Yu, Yongkang Huang, Libiao Peng, Jiaming Yang, Xiyao Xiao, Sahand Sabour, Xiaohan Zhang, Wenjing Hou, Yijia Zhang, Yuxiao Dong, Jie Tang, Minlie Huang

  • ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human [Arxiv] [Code]

    Junfeng Tian, Hehong Chen, Guohai Xu, Ming Yan, Xing Gao, Jianhai Zhang, Chenliang Li, Jiayi Liu, Wenshen Xu, Haiyang Xu, Qi Qian, Wei Wang, Qinghao Ye, Jiejing Zhang, Ji Zhang, Fei Huang, Jingren Zhou

  • Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning [EMNLP2023]

    Ryan Shea, Zhou Yu

  • P5: Plug-and-Play Persona Prompting for Personalized Response Selection [EMNLP2023] [code]

    Joosung Lee, Minsik Oh, Donghun Lee

  • RECAP: Retrieval-Enhanced Context-Aware Prefix Encoder for Personalized Dialogue Response Generation [ACL2023] [code]

    Shuai Liu, Hyundong Cho, Marjorie Freedman, Xuezhe Ma, Jonathan May

  • MPCHAT: Towards Multimodal Persona-Grounded Conversation [ACL2023] [code]

    Jaewoo Ahn, Yeda Song, Sangdoo Yun, Gunhee Kim

  • Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models [ACL2023]

    Myra Cheng, Esin Durmus, Dan Jurafsky

  • Learning to Predict Persona Information for Dialogue Personalization without Explicit Persona Description [ACL2023]

    Wangchunshu Zhou, Qifei Li, Chenle Li

  • Towards Robust Personalized Dialogue Generation via Order-Insensitive Representation Regularization [ACL2023] [Code]

    Liang Chen Hongru Wang Yang Deng Wai Chung Kwan Zezhong Wang Kam-Fai Wong

  • Towards Zero-Shot Persona Dialogue Generation with In-Context Learning [ACL2023]

    Xinchao Xu, Zeyang Lei, Wenquan Wu, Zheng-Yu Niu, Hua Wu, Haifeng Wang

  • SimOAP: Improve Coherence and Consistency in Persona-based Dialogue Generation via Over-sampling and Post-evaluation [ACL2023]

    Junkai Zhou, Liang Pang, Huawei Shen, Xueqi Cheng

  • Enhancing Personalized Dialogue Generation with Contrastive Latent Variables: Combining Sparse and Dense Persona [ACL2023]

    Yihong Tang, Bo Wang, Miao Fang, Dongming Zhao, Kun Huang, Ruifang He, Yuexian Hou

  • CharacterChat: Learning towards Conversational AI with Personalized Social Support [Arxiv] [code]

    Quan Tu, Chuanqi Chen, Jinpeng Li, Yanran Li, Shuo Shang, Dongyan Zhao, Ran Wang, Rui Yan

  • Large Language Models as Source Planner for Personalized Knowledge-grounded Dialogue [Arxiv]

    Hongru Wang, Minda Hu, Yang Deng, Rui Wang, Fei Mi, Weichao Wang, Yasheng Wang, Wai-Chung Kwan, Irwin King, Kam-Fai Wong

  • PersonaPKT: Building Personalized Dialogue Agents via Parameter-efficient Knowledge Transfer [Arxiv]

    Xu Han, Bin Guo, Yoon Jung, Benjamin Yao, Yu Zhang, Xiaohu Liu, Chenlei Guo

  • Aligning Language Models to User Opinions [Arxiv]

    EunJeong Hwang, Bodhisattwa Prasad Majumder, Niket Tandon

  • Toxicity in ChatGPT: Analyzing Persona-assigned Language Models [Arxiv]

    Ameet Deshpande, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik Narasimhan

  • Learning to Memorize Entailment and Discourse Relations for Persona-Consistent Dialogues [AAAI2023] [Code]

    Ruijun Chen, Jin Wang, Liang-Chih Yu, Xuejie Zhang

  • Improving Personalized Explanation Generation through Visualization [ACL2022]

    Shijie Geng, Zuohui Fu, Yingqiang Ge, Lei Li, Gerard de Melo, Yongfeng Zhang

  • There Are a Thousand Hamlets in a Thousand People’s Eyes: Enhancing Knowledge-grounded Dialogue with Personal Memory [ACL2022]

    Tingchen Fu, Xueliang Zhao, Chongyang Tao, Ji-Rong Wen, Rui Yan

  • Long Time No See! Open-Domain Conversation with Long-Term Persona Memory [ACL2022] [code]

    Xinchao Xu, Zhibin Gou, Wenquan Wu, Zheng-Yu Niu, Hua Wu, Haifeng Wang, Shihang Wang

  • A Model-agnostic Data Manipulation Method for Persona-based Dialogue Generation [ACL2022]

    Yu Cao, Wei Bi, Meng Fang, Shuming Shi, Dacheng Tao

  • PAIGE: Personalized Adaptive Interactions Graph Encoder for Query Rewriting in Dialogue Systems [EMNLP2022]

    Daniel Biś, Saurabh Gupta, Jie Hao, Xing Fan, Chenlei Guo

  • AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant [EMNLP2022]

    Weixian Lei, Difei Gao, Yuxuan Wang, Dongxing Mao, Zihan Liang, Lingmin Ran, Mike Zheng Shou

  • MCP: Self-supervised Pre-training for Personalized Chatbots with Multi-level Contrastive Sampling [EMNLP2022]

    Zhaoheng Huang, Zhicheng Dou, Yutao Zhu, Zhengyi Ma

  • Persona-Based Conversational AI: State of the Art and Challenges [ICDMW2022]

    Junfeng Liu, Christopher Symons, Ranga Raju Vatsavai

  • Less is More: Learning to Refine Dialogue History for Personalized Dialogue Generation [NAACL2022]

    Hanxun Zhong, Zhicheng Dou, Yutao Zhu, Hongjin Qian, Ji-Rong Wen

  • COSPLAY: Concept Set Guided Personalized Dialogue Generation Across Both Party Personas [SIGIR2022] [Code]

    Chen Xu, Piji Li, Wei Wang, Haoran Yang, Siyun Wang, Chuangbai Xiao

  • BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data [ACL2021] [code]

    Haoyu Song, Yan Wang, Kaiyan Zhang, Wei-Nan Zhang, Ting Liu

3. Role-Playing Language Model Agents (RPLAs)

  • Unleashing the Emergent Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration [NAACL2024] [Code]

    Zhenhailong Wang, Shaoguang Mao, Wenshan Wu, Tao Ge, Furu Wei, Heng Ji

  • HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing [Arxiv]

    Jing Chen, Xinyu Zhu, Cheng Yang, Chufan Shi, Yadong Xi, Yuxiang Zhang, Junjie Wang, Jiashu Pu, Rongsheng Zhang, Yujiu Yang, Tian Feng

  • On the Decision-Making Abilities in Role-Playing using Large Language Models [Arxiv]

    Chenglei Shen, Guofu Xie, Xiao Zhang, Jun Xu

  • Generative Agents: Interactive Simulacra of Human Behavior [UIST23] [Code]

    Joon Sung Park, Joseph C. O'Brien, Carrie J. Cai, Meredith Ringel Morris, Percy Liang, Michael S. Bernstein

  • Humanoid Agents: Platform for Simulating Human-like Generative Agents [EMNLP2023] [Code]

    Zhilin Wang, Yu Ying Chiu, Yu Cheung Chiu

  • Language models show human-like content effects on reasoning tasks [PNAS Nexus]

    Ishita Dasgupta, Andrew K. Lampinen, Stephanie C. Y. Chan, Hannah R. Sheahan, Antonia Creswell, Dharshan Kumaran, James L. McClelland, Felix Hill