Détection de locuteurs dans les séries TV

12/18/2018
by   Xavier Bost, et al.
0

Speaker diarization of audio streams turns out to be particularly challenging when applied to fictional films, where many characters talk in various acoustic conditions (background music, sound effects, variations in intonation...). Despite this acoustic variability, such movies exhibit specific visual patterns, particularly within dialogue scenes. In this paper, we introduce a two-step method to achieve speaker diarization in TV series: speaker diarization is first performed locally within scenes visually identified as dialogues; then, the hypothesized local speakers are compared to each other during a second clustering process in order to detect recurring speakers: this second stage of clustering is subject to the constraint that the different speakers involved in the same dialogue have to be assigned to different clusters. The performances of our approach are compared to those obtained by standard speaker diarization tools applied to the same data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/18/2018

Constrained speaker diarization of TV series based on visual patterns

Speaker diarization, usually denoted as the 'who spoke when' task, turns...
research
12/18/2018

Audiovisual speaker diarization of TV series

Speaker diarization may be difficult to achieve when applied to narrativ...
research
03/30/2022

Using Active Speaker Faces for Diarization in TV shows

Speaker diarization is one of the critical components of computational m...
research
03/11/2018

Path of Vowel Raising in Chengdu Dialect of Mandarin

He and Rao (2013) reported a raising phenomenon of /a/ in /Xan/ (X being...
research
08/04/2023

Speaker Diarization of Scripted Audiovisual Content

The media localization industry usually requires a verbatim script of th...
research
02/17/2020

Serial Speakers: a Dataset of TV Series

For over a decade, TV series have been drawing increasing interest, both...
research
08/25/2011

Une analyse basée sur la S-DRT pour la modélisation de dialogues pathologiques

In this article, we present a corpus of dialogues between a schizophreni...

Please sign up or login with your details

Forgot password? Click here to reset