DiscoDVT: Generating Long Text with Discourse-Aware Discrete Variational Transformer

10/12/2021
by   Haozhe Ji, et al.
0

Despite the recent advances in applying pre-trained language models to generate high-quality texts, generating long passages that maintain long-range coherence is yet challenging for these models. In this paper, we propose DiscoDVT, a discourse-aware discrete variational Transformer to tackle the incoherence issue. DiscoDVT learns a discrete variable sequence that summarizes the global structure of the text and then applies it to guide the generation process at each decoding step. To further embed discourse-aware information into the discrete latent representations, we introduce an auxiliary objective to model the discourse relations within the text. We conduct extensive experiments on two open story generation datasets and demonstrate that the latent codes learn meaningful correspondence to the discourse structures that guide the model to generate long texts with better long-range coherence.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/19/2021

Long Text Generation by Modeling Sentence-Level and Discourse-Level Coherence

Generating long and coherent text is an important but challenging task, ...
research
12/27/2016

The ontogeny of discourse structure mimics the development of literature

Discourse varies with age, education, psychiatric state and historical e...
research
03/21/2022

Language modeling via stochastic processes

Modern language models can generate high-quality short texts. However, t...
research
09/05/2021

Transformer Models for Text Coherence Assessment

Coherence is an important aspect of text quality and is crucial for ensu...
research
05/10/2018

Discourse-Aware Neural Rewards for Coherent Text Generation

In this paper, we investigate the use of discourse-aware rewards with re...
research
04/13/2023

LasUIE: Unifying Information Extraction with Latent Adaptive Structure-aware Generative Language Model

Universally modeling all typical information extraction tasks (UIE) with...
research
04/22/2022

ChapterBreak: A Challenge Dataset for Long-Range Language Models

While numerous architectures for long-range language models (LRLMs) have...

Please sign up or login with your details

Forgot password? Click here to reset