Knowledge Enhanced Fine-Tuning for Better Handling Unseen Entities in Dialogue Generation

09/12/2021
by   Leyang Cui, et al.
0

Although pre-training models have achieved great success in dialogue generation, their performance drops dramatically when the input contains an entity that does not appear in pre-training and fine-tuning datasets (unseen entity). To address this issue, existing methods leverage an external knowledge base to generate appropriate responses. In real-world scenario, the entity may not be included by the knowledge base or suffer from the precision of knowledge retrieval. To deal with this problem, instead of introducing knowledge base as the input, we force the model to learn a better semantic representation by predicting the information in the knowledge base, only based on the input context. Specifically, with the help of a knowledge base, we introduce two auxiliary training objectives: 1) Interpret Masked Word, which conjectures the meaning of the masked entity given the context; 2) Hypernym Generation, which predicts the hypernym of the entity based on the context. Experiment results on two dialogue corpus verify the effectiveness of our methods under both knowledge available and unavailable settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/11/2022

Generative Biomedical Entity Linking via Knowledge Base-Guided Pre-training and Synonyms-Aware Fine-tuning

Entities lie in the heart of biomedical natural language understanding, ...
research
11/15/2021

Calculating Question Similarity is Enough:A New Method for KBQA Tasks

Knowledge Base Question Answering (KBQA) aims to answer natural language...
research
02/08/2021

Partial Is Better Than All: Revisiting Fine-tuning Strategy for Few-shot Learning

The goal of few-shot learning is to learn a classifier that can recogniz...
research
07/08/2022

DSTEA: Dialogue State Tracking with Entity Adaptive Pre-training

Dialogue state tracking (DST) is a core sub-module of a dialogue system,...
research
01/30/2023

GE-Blender: Graph-Based Knowledge Enhancement for Blender

Although the great success of open-domain dialogue generation, unseen en...
research
12/04/2019

AMUSED: A Multi-Stream Vector Representation Method for Use in Natural Dialogue

The problem of building a coherent and non-monotonous conversational age...
research
03/12/2018

Entity-Aware Language Model as an Unsupervised Reranker

In language modeling, it is difficult to incorporate entity relationship...

Please sign up or login with your details

Forgot password? Click here to reset