Large language models (LLMs) implicitly learn to perform a range of lang...
Figurative language permeates human communication, but at the same time ...
In this paper, we present MasakhaPOS, the largest part-of-speech (POS)
d...
Developing Automatic Speech Recognition (ASR) for low-resource languages...
BibleTTS is a large, high-quality, open speech dataset for ten languages...
Modern speech synthesis techniques can produce natural-sounding speech g...
Despite the progress in machine translation quality estimation and evalu...
We take a step towards addressing the under-representation of the Africa...
Research in NLP lacks geographic diversity, and the question of how NLP ...