Cinematic audio source separation is a relatively new subtask of audio s...
We introduce the ÌròyìnSpeech corpus – a new dataset influenced
by a des...
BibleTTS is a large, high-quality, open speech dataset for ten languages...
This paper describes foundational efforts with SautiDB-Naija, a novel co...
We propose a dataset, AVASpeech-SMAD, to assist speech and music activit...
With the success of large-scale pre-training and multilingual modeling i...
We take a step towards addressing the under-representation of the Africa...
Research in NLP lacks geographic diversity, and the question of how NLP ...
Many Nigerian languages have relinquished their previous prestige and pu...
Yorùbá is a widely spoken West African language with a writing system
ri...
Africa has over 2000 languages. Despite this, African languages account ...
Traffic Pumping attacks are a form of high-volume SPAM that target telep...
In this paper, we describe recent improvements to the production Marchex...
Yorùbá is a widely spoken West African language with a writing system
ri...
For conversational large-vocabulary continuous speech recognition (LVCSR...