What does it take to create the Babel Fish, a tool that can help individ...
Driven by the goal of eradicating language barriers on a global scale,
m...
A multilingual tokenizer is a fundamental component of multilingual neur...
One of the biggest challenges hindering progress in low-resource and
mul...
Fact checking at scale is difficult – while the number of active fact
ch...
Existing work in translation demonstrated the potential of massively
mul...
We show that margin-based bitext mining in a multilingual sentence space...
This paper shows that pretraining multilingual language models at scale ...
Pre-training text representations have led to significant improvements i...
This paper describes Facebook AI's submission to the WAT 2019 Myanmar-En...
We introduce Trans-gram, a simple and computationally-efficient method t...