Spectral Clustering-aware Learning of Embeddings for Speaker Diarisation

10/24/2022
by   Evonne P. C. Lee, et al.
0

In speaker diarisation, speaker embedding extraction models often suffer from the mismatch between their training loss functions and the speaker clustering method. In this paper, we propose the method of spectral clustering-aware learning of embeddings (SCALE) to address the mismatch. Specifically, besides an angular prototype cal (AP) loss, SCALE uses a novel affinity matrix loss which directly minimises the error between the affinity matrix estimated from speaker embeddings and the reference. SCALE also includes p-percentile thresholding and Gaussian blur as two important hyper-parameters for spectral clustering in training. Experiments on the AMI dataset showed that speaker embeddings obtained with SCALE achieved over 50 reductions using oracle segmentation, and over 30 rate reductions using automatic segmentation when compared to a strong baseline with the AP-loss-based speaker embeddings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/05/2020

Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap

In this study, we propose a new spectral clustering framework that can a...
research
07/23/2019

LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization

More and more neural network approaches have achieved considerable impro...
research
05/22/2020

Speaker diarization with session-level speaker embedding refinement using graph neural networks

Deep speaker embedding models have been commonly used as a building bloc...
research
08/18/2015

Deep clustering: Discriminative embeddings for segmentation and separation

We address the problem of acoustic source separation in a deep learning ...
research
11/05/2020

Multi-class Spectral Clustering with Overlaps for Speaker Diarization

This paper describes a method for overlap-aware speaker diarization. Giv...
research
04/06/2021

Speaker Diarization using Two-pass Leave-One-Out Gaussian PLDA Clustering of DNN Embeddings

Many modern systems for speaker diarization, such as the recently-develo...
research
12/21/2015

Analysis of Vessel Connectivities in Retinal Images by Cortically Inspired Spectral Clustering

Retinal images provide early signs of diabetic retinopathy, glaucoma, an...

Please sign up or login with your details

Forgot password? Click here to reset