Survey of Low-Resource Machine Translation

09/01/2021
by   Barry Haddow, et al.
0

We present a survey covering the state of the art in low-resource machine translation. There are currently around 7000 languages spoken in the world and almost all language pairs lack significant resources for training machine translation models. There has been increasing interest in research addressing the challenge of producing useful translation models when very little translated training data is available. We present a high level summary of this topical field and provide an overview of best practices.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/27/2022

OCR Improves Machine Translation for Low-Resource Languages

We aim to investigate the performance of current OCR systems on low reso...
research
11/10/2020

Neural Machine Translation for Extremely Low-Resource African Languages: A Case Study on Bambara

Low-resource languages present unique challenges to (neural) machine tra...
research
07/11/2022

No Language Left Behind: Scaling Human-Centered Machine Translation

Driven by the goal of eradicating language barriers on a global scale, m...
research
11/29/2022

Learnings from Technological Interventions in a Low Resource Language: Enhancing Information Access in Gondi

The primary obstacle to developing technologies for low-resource languag...
research
10/09/2016

Enabling Medical Translation for Low-Resource Languages

We present research towards bridging the language gap between migrant wo...
research
10/01/2019

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Neural sequence-to-sequence models, particularly the Transformer, are th...
research
10/13/2020

The Tatoeba Translation Challenge – Realistic Data Sets for Low Resource and Multilingual MT

This paper describes the development of a new benchmark for machine tran...

Please sign up or login with your details

Forgot password? Click here to reset