Toward More Effective Human Evaluation for Machine Translation

04/11/2022
by   Belen Saldias, et al.
0

Improvements in text generation technologies such as machine translation have necessitated more costly and time-consuming human evaluation procedures to ensure an accurate signal. We investigate a simple way to reduce cost by reducing the number of text segments that must be annotated in order to accurately predict a score for a complete test set. Using a sampling approach, we demonstrate that information from document membership and automatic metrics can help improve estimates compared to a pure random sampling baseline. We achieve gains of up to 20 sampling and control variates. Our techniques can improve estimates made from a fixed annotation budget, are easy to implement, and can be applied to any problem with structure similar to the one we study.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/10/2021

Automatic Classification of Human Translation and Machine Translation: A Study from the Perspective of Lexical Diversity

By using a trigram model and fine-tuning a pretrained BERT model for seq...
research
06/06/2023

Correction of Errors in Preference Ratings from Automated Metrics for Text Generation

A major challenge in the field of Text Generation is evaluation: Human e...
research
04/30/2020

NUBIA: NeUral Based Interchangeability Assessor for Text Generation

We present NUBIA, a methodology to build automatic evaluation metrics fo...
research
05/11/2023

Subword Segmental Machine Translation: Unifying Segmentation and Target Sentence Generation

Subword segmenters like BPE operate as a preprocessing step in neural ma...
research
08/11/2020

A parallel evaluation data set of software documentation with document structure annotation

This paper accompanies the software documentation data set for machine t...
research
06/29/2023

Joint Level Generation and Translation Using Gameplay Videos

Procedural Content Generation via Machine Learning (PCGML) faces a signi...
research
12/18/2022

Rainproof: An Umbrella To Shield Text Generators From Out-Of-Distribution Data

As more and more conversational and translation systems are deployed in ...

Please sign up or login with your details

Forgot password? Click here to reset