Experiments of ASR-based mispronunciation detection for children and adult English learners

04/13/2021
by   Nina Hosseini-Kivanani, et al.
0

Pronunciation is one of the fundamentals of language learning, and it is considered a primary factor of spoken language when it comes to an understanding and being understood by others. The persistent presence of high error rates in speech recognition domains resulting from mispronunciations motivates us to find alternative techniques for handling mispronunciations. In this study, we develop a mispronunciation assessment system that checks the pronunciation of non-native English speakers, identifies the commonly mispronounced phonemes of Italian learners of English, and presents an evaluation of the non-native pronunciation observed in phonetically annotated speech corpora. In this work, to detect mispronunciations, we used a phone-based ASR implemented using Kaldi. We used two non-native English labeled corpora; (i) a corpus of Italian adults contains 5,867 utterances from 46 speakers, and (ii) a corpus of Italian children consists of 5,268 utterances from 78 children. Our results show that the selected error model can discriminate correct sounds from incorrect sounds in both native and nonnative speech, and therefore can be used to detect pronunciation errors in non-native speech. The phone error rates show improvement in using the error language model. The ASR system shows better accuracy after applying the error model on our selected corpora.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/29/2023

Automatic Speech Recognition of Non-Native Child Speech for Language Learning Applications

Voicebots have provided a new avenue for supporting the development of l...
research
09/06/2019

Neural Network-Based Modeling of Phonetic Durations

A deep neural network (DNN)-based model has been developed to predict no...
research
12/13/2016

Performance Improvements of Probabilistic Transcript-adapted ASR with Recurrent Neural Network and Language-specific Constraints

Mismatched transcriptions have been proposed as a mean to acquire probab...
research
10/24/2022

Proficiency assessment of L2 spoken English using wav2vec 2.0

The increasing demand for learning English as a second language has led ...
research
04/03/2021

speechocean762: An Open-Source Non-native English Speech Corpus For Pronunciation Assessment

This paper introduces a new open-source speech corpus named "speechocean...
research
09/25/2018

Non-native children speech recognition through transfer learning

This work deals with non-native children's speech and investigates both ...
research
12/08/2021

A study on native American English speech recognition by Indian listeners with varying word familiarity level

In this study, listeners of varied Indian nativities are asked to listen...

Please sign up or login with your details

Forgot password? Click here to reset