PARL: A Dialog System Framework with Prompts as Actions for Reinforcement Learning

Tao Xiang, Yangzhe Li, Monika Wintergerst, Ana Pecini, Dominika Młynarczyk, Georg Groh

Research output: Contribution to journalConference articlepeer-review

Abstract

The performance of most current open-domain dialog systems is limited by the (training) dialog corpora due to either generation-based or retrieval-based learning patterns. To circumvent this limitation, we propose PARL, an open-domain dialog system framework using Prompts as Actions for Reinforcement Learning. This framework requires a (fixed) open-domain dialog system as the backbone and trains a behavior policy using reinforcement learning to guide the backbone system to respond appropriately with respect to a given conversation. The action space is defined as a finite set of behaviors in the form of natural language prompts. Preliminary results show that with the guidance of the behavior policy, the backbone system could generate more engaging and empathetic responses.

Original languageEnglish
Pages (from-to)633-640
Number of pages8
JournalInternational Conference on Agents and Artificial Intelligence
Volume3
DOIs
StatePublished - 2023
Event15th International Conference on Agents and Artificial Intelligence, ICAART 2023 - Lisbon, Portugal
Duration: 22 Feb 202324 Feb 2023

Keywords

  • Conversational AI
  • Open-Domain Dialog Systems
  • Prompting
  • Reinforcement Learning

Fingerprint

Dive into the research topics of 'PARL: A Dialog System Framework with Prompts as Actions for Reinforcement Learning'. Together they form a unique fingerprint.

Cite this