A new hybrid metric for verifying parallel corpora of Arabic-English

02/12/2015
by   Saad Alkahtani, et al.
0

This paper discusses a new metric that has been applied to verify the quality in translation between sentence pairs in parallel corpora of Arabic-English. This metric combines two techniques, one based on sentence length and the other based on compression code length. Experiments on sample test parallel Arabic-English corpora indicate the combination of these two techniques improves accuracy of the identification of satisfactory and unsatisfactory sentence pairs compared to sentence length and compression code length alone. The new method proposed in this research is effective at filtering noise and reducing mis-translations resulting in greatly improved quality.

READ FULL TEXT
research
02/01/2019

Human acceptability judgements for extractive sentence compression

Recent approaches to English-language sentence compression rely on paral...
research
12/26/2019

Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation

Lectures translation is a case of spoken language translation and there ...
research
07/13/2018

Image Classification for Arabic: Assessing the Accuracy of Direct English to Arabic Translations

Image classification is an ongoing research challenge. Most of the avail...
research
08/18/2018

A Recipe for Arabic-English Neural Machine Translation

In this paper, we present a recipe for building a good Arabic-English ne...
research
12/01/2020

Extracting Synonyms from Bilingual Dictionaries

We present our progress in developing a novel algorithm to extract synon...
research
09/11/2015

A Parallel Corpus of Translationese

We describe a set of bilingual English--French and English--German paral...
research
05/20/2022

Current Trends and Approaches in Synonyms Extraction: Potential Adaptation to Arabic

Extracting synonyms from dictionaries or corpora is gaining special atte...

Please sign up or login with your details

Forgot password? Click here to reset