Fast Neural Machine Translation Implementation

05/24/2018
by   Hieu Hoang, et al.
0

This paper describes the submissions to the efficiency track for GPUs by members of the University of Edinburgh, Adam Mickiewicz University, Tilde and University of Alicante. We focus on efficient implementation of the recurrent deep-learning model as implemented in Amun, the fast inference engine for neural machine translation. We improve the performance with an efficient mini-batching algorithm and by fusing the softmax operation with k-best extraction algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/16/2018

Tensor2Tensor for Neural Machine Translation

Tensor2Tensor is a library for deep learning models that is well-suited ...
research
04/01/2018

Marian: Fast Neural Machine Translation in C++

We present Marian, an efficient and self-contained Neural Machine Transl...
research
09/22/2017

Neural Machine Translation

Draft of textbook chapter on neural machine translation. a comprehensive...
research
07/20/2020

Neural Machine Translation model for University Email Application

Machine translation has many applications such as news translation, emai...
research
09/09/2018

Speeding Up Neural Machine Translation Decoding by Cube Pruning

Although neural machine translation has achieved promising results, it s...
research
10/19/2018

Impact of Corpora Quality on Neural Machine Translation

Large parallel corpora that are automatically obtained from the web, doc...
research
06/29/2019

The CUED's Grammatical Error Correction Systems for BEA-2019

We describe two entries from the Cambridge University Engineering Depart...

Please sign up or login with your details

Forgot password? Click here to reset