Evaluating Factuality in Text Simplification

04/15/2022
by   Ashwin Devaraj, et al.
0

Automated simplification models aim to make input texts more readable. Such methods have the potential to make complex information accessible to a wider audience, e.g., providing access to recent medical literature which might otherwise be impenetrable for a lay reader. However, such models risk introducing errors into automatically simplified texts, for instance by inserting statements unsupported by the corresponding original text, or by omitting key information. Providing more readable but inaccurate versions of texts may in many cases be worse than providing no such access at all. The problem of factual accuracy (and the lack thereof) has received heightened attention in the context of summarization models, but the factuality of automatically simplified texts has not been investigated. We introduce a taxonomy of errors that we use to analyze both references drawn from standard simplification datasets and state-of-the-art model outputs. We find that errors often appear in both that are not captured by existing evaluation metrics, motivating a need for research into ensuring the factual accuracy of automated simplification models.

READ FULL TEXT
research
12/22/2020

Simple-QE: Better Automatic Quality Estimation for Text Simplification

Text simplification systems generate versions of texts that are easier t...
research
05/21/2023

Multilingual Simplification of Medical Texts

Automated text simplification aims to produce simple versions of complex...
research
09/16/2015

amLite: Amharic Transliteration Using Key Map Dictionary

amLite is a framework developed to map ASCII transliterated Amharic text...
research
06/18/2023

MISMATCH: Fine-grained Evaluation of Machine-generated Text with Mismatch Error Types

With the growing interest in large language models, the need for evaluat...
research
06/03/2019

Handling Divergent Reference Texts when Evaluating Table-to-Text Generation

Automatically constructed datasets for generating text from semi-structu...
research
02/11/2023

NapSS: Paragraph-level Medical Text Simplification via Narrative Prompting and Sentence-matching Summarization

Accessing medical literature is difficult for laypeople as the content i...
research
12/07/2020

Stylometry for Noisy Medieval Data: Evaluating Paul Meyer's Hagiographic Hypothesis

Stylometric analysis of medieval vernacular texts is still a significant...

Please sign up or login with your details

Forgot password? Click here to reset