Trainable Greedy Decoding for Neural Machine Translation

02/08/2017
by   Jiatao Gu, et al.
0

Recent research in neural machine translation has largely focused on two aspects; neural network architectures and end-to-end learning algorithms. The problem of decoding, however, has received relatively little attention from the research community. In this paper, we solely focus on the problem of decoding given a trained neural machine translation model. Instead of trying to build a new decoding algorithm for any specific decoding objective, we propose the idea of trainable decoding algorithm in which we train a decoding algorithm to find a translation that maximizes an arbitrary decoding objective. More specifically, we design an actor that observes and manipulates the hidden state of the neural machine translation decoder and propose to train it using a variant of deterministic policy gradient. We extensively evaluate the proposed algorithm using four language pairs and two decoding objectives and show that we can indeed train a trainable greedy decoder that generates a better translation (in terms of a target decoding objective) with minimal computational overhead.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/22/2017

Neural Machine Translation with Gumbel-Greedy Decoding

Previous neural machine translation models used some heuristic search al...
research
05/12/2016

Noisy Parallel Approximate Decoding for Conditional Recurrent Language Model

Recent advances in conditional recurrent language modelling have mainly ...
research
04/21/2018

A Stable and Effective Learning Strategy for Trainable Greedy Decoding

As a widely used approximate search strategy for neural network decoders...
research
06/03/2019

Training Neural Machine Translation To Apply Terminology Constraints

This paper proposes a novel method to inject custom terminology into neu...
research
01/11/2017

Towards Decoding as Continuous Optimization in Neural Machine Translation

We propose a novel decoding approach for neural machine translation (NMT...
research
07/18/2022

MAD for Robust Reinforcement Learning in Machine Translation

We introduce a new distributed policy gradient algorithm and show that i...
research
08/30/2021

Scheduled Sampling Based on Decoding Steps for Neural Machine Translation

Scheduled sampling is widely used to mitigate the exposure bias problem ...

Please sign up or login with your details

Forgot password? Click here to reset