Glottal source estimation robustness: A comparison of sensitivity of voice source estimation techniques

05/24/2020
by   Thomas Drugman, et al.
0

This paper addresses the problem of estimating the voice source directly from speech waveforms. A novel principle based on Anticausality Dominated Regions (ACDR) is used to estimate the glottal open phase. This technique is compared to two other state-of-the-art well-known methods, namely the Zeros of the Z-Transform (ZZT) and the Iterative Adaptive Inverse Filtering (IAIF) algorithms. Decomposition quality is assessed on synthetic signals through two objective measures: the spectral distortion and a glottal formant determination rate. Technique robustness is tested by analyzing the influence of noise and Glottal Closure Instant (GCI) location errors. Besides impacts of the fundamental frequency and the first formant on the performance are evaluated. Our proposed approach shows significant improvement in robustness, which could be of a great interest when decomposing real speech.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/28/2019

A Comparative Study of Glottal Source Estimation Techniques

Source-tract decomposition (or glottal flow estimation) is one of the ba...
research
01/02/2020

On the Mutual Information between Source and Filter Contributions for Voice Pathology Detection

This paper addresses the problem of automatic detection of voice patholo...
research
05/31/2020

Maximum Voiced Frequency Estimation: Exploiting Amplitude and Phase Spectra

Maximum Voiced Frequency (MVF) is used in various speech models as the s...
research
12/28/2019

Glottal Closure and Opening Instant Detection from Speech Signals

This paper proposes a new procedure to detect Glottal Closure and Openin...
research
12/21/2017

On the Use of a Spectral Glottal Model for the Source-filter Separation of Speech

The estimation of glottal flow from a speech waveform is a key method fo...
research
05/16/2020

Glottal Source Estimation using an Automatic Chirp Decomposition

In a previous work, we showed that the glottal source can be estimated f...
research
03/07/2019

Voice Activity Detection: Merging Source and Filter-based Information

Voice Activity Detection (VAD) refers to the problem of distinguishing s...

Please sign up or login with your details

Forgot password? Click here to reset