Phrase-level Active Learning for Neural Machine Translation

by   Junjie Hu, et al.

Neural machine translation (NMT) is sensitive to domain shift. In this paper, we address this problem in an active learning setting where we can spend a given budget on translating in-domain data, and gradually fine-tune a pre-trained out-of-domain NMT model on the newly translated data. Existing active learning methods for NMT usually select sentences based on uncertainty scores, but these methods require costly translation of full sentences even when only one or two key phrases within the sentence are informative. To address this limitation, we re-examine previous work from the phrase-based machine translation (PBMT) era that selected not full sentences, but rather individual phrases. However, while incorporating these phrases into PBMT systems was relatively simple, it is less trivial for NMT systems, which need to be trained on full sequences to capture larger structural properties of sentences unique to the new domain. To overcome these hurdles, we propose to select both full sentences and individual phrases from unlabelled data in the new domain for routing to human translators. In a German-English translation task, our active learning approach achieves consistent improvements over uncertainty-based sentence selection methods, improving up to 1.2 BLEU score over strong active learning baselines.


page 1

page 2

page 3

page 4


Towards Neural Phrase-based Machine Translation

In this paper, we present Neural Phrase-based Machine Translation (NPMT)...

COMET-QE and Active Learning for Low-Resource Machine Translation

Active learning aims to deliver maximum benefit when resources are scarc...

Domain Adaptation of Neural Machine Translation by Lexicon Induction

It has been previously noted that neural machine translation (NMT) is ve...

Recognizing Extended Spatiotemporal Expressions by Actively Trained Average Perceptron Ensembles

Precise geocoding and time normalization for text requires that location...

Neural Machine Translation Model with a Large Vocabulary Selected by Branching Entropy

Neural machine translation (NMT), a new approach to machine translation,...

Looking for a Needle in a Haystack: A Comprehensive Study of Hallucinations in Neural Machine Translation

Although the problem of hallucinations in neural machine translation (NM...

Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation

We explore how to improve machine translation systems by adding more tra...

Please sign up or login with your details

Forgot password? Click here to reset