STOAT: Structured Data to Analytical Text With Controls

05/19/2023
by   Deepanway Ghosal, et al.
0

Recent language models have made tremendous progress in the structured data to text generation task. However, these models still give sub-optimal performance where logical inference is required to generate the descriptions. In this work, we specifically focus on analytical text generation from structured data such as tables. Building on the taxonomy proposed in (Gupta et al., 2020) we focus on controllable table to text generation for the following reasoning categories: numerical reasoning, commonsense reasoning, temporal reasoning, table knowledge, and entity knowledge. We propose STOAT model, which is table and reasoning aware, with vector-quantization to infuse the given reasoning categories in the output. We observe that our model provides 10.19 1.13 analytical sentence task. We also found that our model generates 15.3 faithful and analytical descriptions as compared to the baseline models in human evaluation. We curate and release two reasoning category annotated table-to-interesting text generation datasets based on the ToTTo (Parikh et al., 2020) and InfoTabs datasets (Gupta et al.,2020).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/16/2021

Learning to Reason for Text Generation from Scientific Tables

In this paper, we introduce SciGen, a new challenge dataset for the task...
research
12/16/2022

MURMUR: Modular Multi-Step Reasoning for Semi-Structured Data-to-Text Generation

Prompting large language models has enabled significant recent progress ...
research
07/06/2020

DART: Open-Domain Structured Data Record to Text Generation

We introduce DART, a large dataset for open-domain structured data recor...
research
09/23/2019

Two Birds, One Stone: A Simple, Unified Model for Text Generation from Structured and Unstructured Data

A number of researchers have recently questioned the necessity of increa...
research
06/03/2019

Handling Divergent Reference Texts when Evaluating Table-to-Text Generation

Automatically constructed datasets for generating text from semi-structu...
research
01/12/2020

Revisiting Challenges in Data-to-Text Generation with Fact Grounding

Data-to-text generation models face challenges in ensuring data fidelity...
research
10/12/2020

Controlled Hallucinations: Learning to Generate Faithfully from Noisy Data

Neural text generation (data- or text-to-text) demonstrates remarkable p...

Please sign up or login with your details

Forgot password? Click here to reset