Recommendation as a Communication Game: Self-Supervised Bot-Play for Goal-oriented Dialogue

by   Dongyeop Kang, et al.

Traditional recommendation systems produce static rather than interactive recommendations invariant to a user's specific requests, clarifications, or current mood, and can suffer from the cold-start problem if their tastes are unknown. These issues can be alleviated by treating recommendation as an interactive dialogue task instead, where an expert recommender can sequentially ask about someone's preferences, react to their requests, and recommend more appropriate items. In this work, we collect a goal-driven recommendation dialogue dataset (GoRecDial), which consists of 9,125 dialogue games and 81,260 conversation turns between pairs of human workers recommending movies to each other. The task is specifically designed as a cooperative game between two players working towards a quantifiable common goal. We leverage the dataset to develop an end-to-end dialogue system that can simultaneously converse and recommend. Models are first trained to imitate the behavior of human players without considering the task goal itself (supervised training). We then finetune our models on simulated bot-bot conversations between two paired pre-trained models (bot-play), in order to achieve the dialogue goal. Our experiments show that models finetuned with bot-play learn improved dialogue strategies, reach the dialogue goal more often when paired with a human, and are rated as more consistent by humans compared to models trained without bot-play. The dataset and code are publicly available through the ParlAI framework.


page 3

page 13

page 14


IRF: Interactive Recommendation through Dialogue

Recent research focuses beyond recommendation accuracy, towards human fa...

DialogZoo: Large-Scale Dialog-Oriented Task Learning

Building unified conversational agents has been a long-standing goal of ...

What Should I Ask? Using Conversationally Informative Rewards for Goal-Oriented Visual Dialog

The ability to engage in goal-oriented conversations has allowed humans ...

Enhancing Large Language Model Induced Task-Oriented Dialogue Systems Through Look-Forward Motivated Goals

Recently, the development of large language models (LLMs) has been signi...

Ask an Expert: Leveraging Language Models to Improve Strategic Reasoning in Goal-Oriented Dialogue Models

Existing dialogue models may encounter scenarios which are not well-repr...

Instruction Clarification Requests in Multimodal Collaborative Dialogue Games: Tasks, and an Analysis of the CoDraw Dataset

In visual instruction-following dialogue games, players can engage in re...

Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue

Goal-directed dialogue systems aim to proactively reach a pre-determined...

Please sign up or login with your details

Forgot password? Click here to reset