Fast transcription of speech in low-resource languages

09/16/2019
by   Mark Hasegawa-Johnson, et al.
0

We present software that, in only a few hours, transcribes forty hours of recorded speech in a surprise language, using only a few tens of megabytes of noisy text in that language, and a zero-resource grapheme to phoneme (G2P) table. A pretrained acoustic model maps acoustic features to phonemes; a reversed G2P maps these to graphemes; then a language model maps these to a most-likely grapheme sequence, i.e., a transcription. This software has worked successfully with corpora in Arabic, Assam, Kinyarwanda, Russian, Sinhalese, Swahili, Tagalog, and Tamil.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/30/2022

Vakyansh: ASR Toolkit for Low Resource Indic languages

We present Vakyansh, an end to end toolkit for Speech Recognition in Ind...
research
04/29/2021

The Interspeech Zero Resource Speech Challenge 2021: Spoken language modelling

We present the Zero Resource Speech Challenge 2021, which asks participa...
research
09/26/2019

DARTS: Dialectal Arabic Transcription System

We present the speech to text transcription system, called DARTS, for lo...
research
06/26/2022

Annotated Speech Corpus for Low Resource Indian Languages: Awadhi, Bhojpuri, Braj and Magahi

In this paper we discuss an in-progress work on the development of a spe...
research
11/02/2022

A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command Recognition

We propose a quantum kernel learning (QKL) framework to address the inhe...
research
03/23/2017

An embedded segmental K-means model for unsupervised segmentation and clustering of speech

Unsupervised segmentation and clustering of unlabelled speech are core p...
research
07/14/2023

Towards spoken dialect identification of Irish

The Irish language is rich in its diversity of dialects and accents. Thi...

Please sign up or login with your details

Forgot password? Click here to reset