Manual Post-editing of Automatically Transcribed Speeches from the Icelandic Parliament - Althingi

07/31/2018
by   Judy Y. Fong, et al.
0

The design objectives for an automatic transcription system are to produce text readable by humans and to minimize the impact on manual post-editing. This study reports on a recognition system used for transcribing speeches in the Icelandic parliament - Althingi. It evaluates the system performance and its effect on manual post-editing. The results are compared against the original manual transcription process. 239 total speeches, consisting of 11 hours and 33 minutes, were processed, both manually and automatically, and the editing process was analysed. The dependence of word edit distance on edit time and the editing real-time factor has been estimated and compared to user evaluations of the transcription system. The main findings show that the word edit distance is positively correlated with edit time and a system achieving a 12.6 distance would match the performance of manual transcribers. Producing perfect transcriptions would result in a real-time factor of 2.56. The study also shows that 99 subjective evaluations. On the contrary, 21 received a bad grade.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/20/2017

Post-edit Analysis of Collective Biography Generation

Text generation is increasingly common but often requires manual post-ed...
research
04/09/2021

A preliminary study on evaluating Consultation Notes with Post-Editing

Automatic summarisation has the potential to aid physicians in streamlin...
research
07/24/2019

Translator2Vec: Understanding and Representing Human Post-Editors

The combination of machines and humans for translation is effective, wit...
research
09/20/2021

Latexify Math: Mathematical Formula Markup Revision to Assist Collaborative Editing in Math Q A Sites

Collaborative editing questions and answers plays an important role in q...
research
07/20/2022

Explicit Image Caption Editing

Given an image and a reference caption, the image caption editing task a...
research
05/11/2022

SubER: A Metric for Automatic Evaluation of Subtitle Quality

This paper addresses the problem of evaluating the quality of automatica...
research
11/21/2020

Iterative Text-based Editing of Talking-heads Using Neural Retargeting

We present a text-based tool for editing talking-head video that enables...

Please sign up or login with your details

Forgot password? Click here to reset