Adriana Stan

research

∙ 09/11/2023

Towards generalisable and calibrated synthetic speech detection with self-supervised representations

Generalisation – the ability of a model to perform well on unseen data –...

0 Dan Oneata, et al. ∙

research

∙ 07/19/2023

An analysis on the effects of speaker embedding choice in non auto-regressive TTS

In this paper we introduce a first attempt on understanding how a non-au...

0 Adriana Stan, et al. ∙

research

∙ 02/06/2023

Residual Information in Deep Speaker Embedding Architectures

Speaker embeddings represent a means to extract representative vectorial...

0 Adriana Stan, et al. ∙

research

∙ 06/07/2022

FlexLip: A Controllable Text-to-Lip System

The task of converting text input into video content is becoming an impo...

0 Dan Oneata, et al. ∙

research

∙ 06/03/2021

An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis

Multi-speaker spoken datasets enable the creation of text-to-speech synt...

0 Beata Lorincz, et al. ∙

research

∙ 06/03/2021

Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis

Building multispeaker neural network-based text-to-speech synthesis syst...

0 Beata Lorincz, et al. ∙

research

∙ 05/20/2021

Speaker disentanglement in video-to-speech conversion

The task of video-to-speech aims to translate silent video of lip moveme...

0 Dan Oneata, et al. ∙

research

∙ 01/14/2021

An evaluation of word-level confidence estimation for end-to-end automatic speech recognition

Quantifying the confidence (or conversely the uncertainty) of a predicti...

0 Dan Oneata, et al. ∙

research

∙ 09/11/2020

RECOApy: Data recording, pre-processing and phonetic transcription for end-to-end speech-based applications

Deep learning enables the development of efficient end-to-end speech pro...

0 Adriana Stan, et al. ∙

Adriana Stan

Featured Co-authors

Sign in with Google

Consider DeepAI Pro