Ammar Abbas

research

∙ 07/31/2023

Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech

Neural text-to-speech systems are often optimized on L1/L2 losses, which...

0 Guangyan Zhang, et al. ∙

research

∙ 07/13/2023

Controllable Emphasis with zero data for text-to-speech

We present a scalable method to produce high quality emphasis for text-t...

0 Arnaud Joly, et al. ∙

research

∙ 06/20/2023

eCat: An End-to-End Model for Multi-Speaker TTS Many-to-Many Fine-Grained Prosody Transfer

We present eCat, a novel end-to-end multispeaker model capable of: a) ge...

0 Ammar Abbas, et al. ∙

research

∙ 06/29/2022

Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody

Generating expressive and contextually appropriate prosody remains a cha...

0 Peter Makarov, et al. ∙

research

∙ 06/28/2022

Expressive, Variable, and Controllable Duration Modelling in TTS

Duration modelling has become an important research problem once more wi...

0 Ammar Abbas, et al. ∙

research

∙ 06/27/2022

CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer

In this paper, we present CopyCat2 (CC2), a novel model capable of: a) s...

0 Sri Karlapati, et al. ∙

research

∙ 06/29/2021

Multi-Scale Spectrogram Modelling for Neural Text-to-Speech

We propose a novel Multi-Scale Spectrogram (MSS) modelling approach to s...

0 Ammar Abbas, et al. ∙

research

∙ 06/14/2021

A learned conditional prior for the VAE acoustic space of a TTS system

Many factors influence speech yielding different renditions of a given s...

0 Penny Karanasou, et al. ∙

research

∙ 11/04/2020

Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech

In this paper, we introduce Kathaka, a model trained with a novel two-st...

0 Sri Karlapati, et al. ∙

research

∙ 05/06/2019

A Geometric Approach to Obtain a Bird's Eye View from an Image

The objective of this paper is to rectify any monocular image by computi...

32 Ammar Abbas, et al. ∙

Ammar Abbas

Featured Co-authors

Sign in with Google

Consider DeepAI Pro