Prak: An automatic phonetic alignment tool for Czech

04/17/2023
by   Václav Hanžl, et al.
0

Labeling speech down to the identity and time boundaries of phones is a labor-intensive part of phonetic research. To simplify this work, we created a free open-source tool generating phone sequences from Czech text and time-aligning them with audio. Low architecture complexity makes the design approachable for students of phonetics. Acoustic model ReLU NN with 56k weights was trained using PyTorch on small CommonVoice data. Alignment and variant selection decoder is implemented in Python with matrix library. A Czech pronunciation generator is composed of simple rule-based blocks capturing the logic of the language where possible, allowing modification of transcription approach details. Compared to tools used until now, data preparation efficiency improved, the tool is usable on Mac, Linux and Windows in Praat GUI or command line, achieves mostly correct pronunciation variant choice including glottal stop detection, algorithmically captures most of Czech assimilation logic and is both didactic and practical.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/19/2022

A machine transliteration tool between Uzbek alphabets

Machine transliteration, as defined in this paper, is a process of autom...
research
10/08/2021

Phone-to-audio alignment without text: A Semi-supervised Approach

The task of phone-to-audio alignment has many applications in speech res...
research
10/19/2020

PySBD: Pragmatic Sentence Boundary Disambiguation

In this paper, we present a rule-based sentence boundary disambiguation ...
research
10/28/2022

System Demo: Tool and Infrastructure for Offensive Language Error Analysis (OLEA) in English

The automatic detection of offensive language is a pressing societal nee...
research
10/30/2018

An architecture of open-source tools to combine textual information extraction, faceted search and information visualisation

This article presents our steps to integrate complex and partly unstruct...
research
12/25/2021

Multi-Dialect Arabic Speech Recognition

This paper presents the design and development of multi-dialect automati...
research
08/14/2023

Computer Aided Design and Grading for an Electronic Functional Programming Exam

Electronic exams (e-exams) have the potential to substantially reduce th...

Please sign up or login with your details

Forgot password? Click here to reset