Dynamic Past and Future for Neural Machine Translation

04/21/2019
by   Zaixiang Zheng, et al.
0

Previous studies have shown that neural machine translation (NMT) models can benefit from modeling translated (Past) and un-translated (Future) source contents as recurrent states (Zheng et al., 2018). However, the recurrent process is less interpretable. In this paper, we propose to model Past and Future by Capsule Network (Hinton et al.,2011), which provides an explicit separation of source words into groups of Past and Future by the process of parts-to-wholes assignment. The assignment is learned with a novel variant of routing-by-agreement mechanism (Sabour et al., 2017), namely Guided Dynamic Routing, in which what to translate at current decoding step guides the routing process to assign each source word to its associated group represented by a capsule, and to refine the representation of the capsule dynamically and iteratively. Experiments on translation tasks of three language pairs show that our model achieves substantial improvements over both RNMT and Transformer. Extensive analysis further verifies that our method does recognize translated and untranslated content as expected, and produces better and more adequate translations.

READ FULL TEXT

page 7

page 12

research
11/27/2017

Modeling Past and Future for Neural Machine Translation

Existing neural machine translation systems do not explicitly model what...
research
09/02/2018

Future-Prediction-Based Model for Neural Machine Translation

We propose a novel model for Neural Machine Translation (NMT). Different...
research
10/17/2016

Neural Machine Translation Advised by Statistical Machine Translation

Neural Machine Translation (NMT) is a new approach to machine translatio...
research
07/18/2016

Neural Machine Translation with Recurrent Attention Modeling

Knowing which words have been attended to in previous time steps while g...
research
06/11/2022

Can the Language of the Collation be Translated into the Language of the Stemma? Using Machine Translation for Witness Localization

Stemmatology is a subfield of philology where one approach to understand...
research
07/01/2019

Avoiding Implementation Pitfalls of "Matrix Capsules with EM Routing" by Hinton et al

The recent progress on capsule networks by Hinton et al. has generated c...
research
08/31/2019

Improving Multi-Head Attention with Capsule Networks

Multi-head attention advances neural machine translation by working out ...

Please sign up or login with your details

Forgot password? Click here to reset