Logic2Text: High-Fidelity Natural Language Generation from Logical Forms

04/30/2020
by   Zhiyu Chen, et al.
0

Previous works on Natural Language Generation (NLG) from structured data have primarily focused on surface-level descriptions of record sequences. However, for complex structured data, e.g., multi-row tables, it is often desirable for an NLG system to describe interesting facts from logical inferences across records. If only provided with the table, it is hard for existing models to produce controllable and high-fidelity logical generations. In this work, we formulate logical level NLG as generation from logical forms in order to obtain controllable, high-fidelity, and faithful generations. We present a new large-scale dataset, Logic2Text, with 10,753 descriptions involving common logic types paired with the underlying logical forms. The logical forms show diversified graph structure of free schema, which poses great challenges on the model's ability to understand the semantics. We experiment on (1) Fully-supervised training with the full datasets, and (2) Few-shot setting, provided with hundreds of paired examples; We compare several popular generation models and analyze their performances. We hope our dataset can encourage research towards building an advanced NLG system capable of natural, faithful, and human-like generation. The dataset and code are available at <https://github.com/czyssrs/Logic2Text>.

READ FULL TEXT
research
04/22/2020

Logical Natural Language Generation from Open-Domain Tables

Neural natural language generation (NLG) models have recently shown rema...
research
05/25/2022

PLOG: Table-to-Logic Pretraining for Logical Table-to-Text Generation

Logical table-to-text generation is a task that involves generating logi...
research
12/12/2021

Improving Logical-Level Natural Language Generation with Topic-Conditioned Data Augmentation and Logical Form Generation

Logical Natural Language Generation, i.e., generating textual descriptio...
research
12/02/2021

LOGEN: Few-shot Logical Knowledge-Conditioned Text Generation with Self-training

Natural language generation from structured data mainly focuses on surfa...
research
10/16/2022

Investigating the Robustness of Natural Language Generation from Logical Forms via Counterfactual Samples

The aim of Logic2Text is to generate controllable and faithful texts con...
research
07/24/2016

Redundancy-free Verbalization of Individuals for Ontology Validation

We investigate the problem of verbalizing Web Ontology Language (OWL) ax...
research
04/20/2018

Generating Descriptions from Structured Data Using a Bifocal Attention Mechanism and Gated Orthogonalization

In this work, we focus on the task of generating natural language descri...

Please sign up or login with your details

Forgot password? Click here to reset