Onception: Active Learning with Expert Advice for Real World Machine Translation

by   Vânia Mendonça, et al.

Active learning can play an important role in low-resource settings (i.e., where annotated data is scarce), by selecting which instances may be more worthy to annotate. Most active learning approaches for Machine Translation assume the existence of a pool of sentences in a source language, and rely on human annotators to provide translations or post-edits, which can still be costly. In this article, we assume a real world human-in-the-loop scenario in which: (i) the source sentences may not be readily available, but instead arrive in a stream; (ii) the automatic translations receive feedback in the form of a rating, instead of a correct/edited translation, since the human-in-the-loop might be a user looking for a translation, but not be able to provide one. To tackle the challenge of deciding whether each incoming pair source-translations is worthy to query for human feedback, we resort to a number of stream-based active learning query strategies. Moreover, since we not know in advance which query strategy will be the most adequate for a certain language pair and set of Machine Translation models, we propose to dynamically combine multiple strategies using prediction with expert advice. Our experiments show that using active learning allows to converge to the best Machine Translation systems with fewer human interactions. Furthermore, combining multiple strategies using prediction with expert advice often outperforms several individual active learning strategies with even fewer interactions.


page 21

page 22

page 23

page 27

page 28

page 32

page 33

page 42


COMET-QE and Active Learning for Low-Resource Machine Translation

Active learning aims to deliver maximum benefit when resources are scarc...

Active Learning for Interactive Neural Machine Translation of Data Streams

We study the application of active learning techniques to the translatio...

Active Learning for Neural Machine Translation

The machine translation mechanism translates texts automatically between...

Are Good Explainers Secretly Human-in-the-Loop Active Learners?

Explainable AI (XAI) techniques have become popular for multiple use-cas...

Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation

We explore how to improve machine translation systems by adding more tra...

The Best of Both Worlds: Combining Human and Machine Translations for Multilingual Semantic Parsing with Active Learning

Multilingual semantic parsing aims to leverage the knowledge from the hi...

Toward Machine-Guided, Human-Initiated Explanatory Interactive Learning

Recent work has demonstrated the promise of combining local explanations...

Please sign up or login with your details

Forgot password? Click here to reset