A Recipe for Arabic-English Neural Machine Translation

08/18/2018
by   Abdullah Alrajeh, et al.
0

In this paper, we present a recipe for building a good Arabic-English neural machine translation. We compare neural systems with traditional phrase-based systems using various parallel corpora including UN, ISI and Ummah. We also investigate the importance of special preprocessing of the Arabic script. The presented results are based on test sets from NIST MT 2005 and 2012. The best neural system produces a gain of +13 BLEU points compared to an equivalent simple phrase-based system in NIST MT12 test set. Unexpectedly, we find that tuning a model trained on the whole data using a small high quality corpus like Ummah gives a substantial improvement (+3 BLEU points). We also find that training a neural system with a small Arabic-English corpus is competitive to a traditional phrase-based system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/08/2016

First Result on Arabic Neural Machine Translation

Neural machine translation has become a major alternative to widely used...
research
01/14/2017

QCRI Machine Translation Systems for IWSLT 16

This paper describes QCRI's machine translation systems for the IWSLT 20...
research
12/18/2017

Low Resourced Machine Translation via Morpho-syntactic Modeling: The Case of Dialectal Arabic

We present the second ever evaluated Arabic dialect-to-dialect machine t...
research
06/18/2016

Egyptian Arabic to English Statistical Machine Translation System for NIST OpenMT'2015

The paper describes the Egyptian Arabic-to-English statistical machine t...
research
10/26/2020

Is it Great or Terrible? Preserving Sentiment in Neural Machine Translation of Arabic Reviews

Since the advent of Neural Machine Translation (NMT) approaches there ha...
research
02/12/2015

A new hybrid metric for verifying parallel corpora of Arabic-English

This paper discusses a new metric that has been applied to verify the qu...
research
10/15/2015

Telemedicine as a special case of Machine Translation

Machine translation is evolving quite rapidly in terms of quality. Nowad...

Please sign up or login with your details

Forgot password? Click here to reset