Mateusz Łajszczak

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Thomas Drugman
59 publications
Jaime Lorenzo-Trueba
29 publications
Ray Li
22 publications
Roberto Barra-Chicote
22 publications
Alexis Moinet
19 publications
Daniel Korzekwa
14 publications
Srikanth Ronanki
14 publications
Antonio Bonafonte
13 publications
Thomas Merritt
11 publications
Bajibabu Bollepalli
11 publications
Arnaud Joly
11 publications

research

∙ 07/13/2023

Controllable Emphasis with zero data for text-to-speech

We present a scalable method to produce high quality emphasis for text-t...

0 Arnaud Joly, et al. ∙

research

∙ 06/29/2022

Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody

Generating expressive and contextually appropriate prosody remains a cha...

0 Peter Makarov, et al. ∙

research

∙ 06/27/2022

CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer

In this paper, we present CopyCat2 (CC2), a novel model capable of: a) s...

0 Sri Karlapati, et al. ∙

research

∙ 02/13/2022

Distribution augmentation for low-resource expressive text-to-speech

This paper presents a novel data augmentation technique for text-to-spee...

0 Mateusz Łajszczak, et al. ∙

research

∙ 10/24/2021

Discrete acoustic space for an efficient sampling in neural text-to-speech

We present an SVQ-VAE architecture using a split vector quantizer for NT...

0 Marek Strelec, et al. ∙

research

∙ 07/10/2019

Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech

This paper proposed a novel approach for the detection and reconstructio...

0 Daniel Korzekwa, et al. ∙

research

∙ 04/04/2019

In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data

Neural text-to-speech synthesis (NTTS) models have shown significant pro...

0 Nishant Prateek, et al. ∙

Success!

An error occurred

Mateusz Łajszczak

Featured Co-authors

Controllable Emphasis with zero data for text-to-speech

Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody

CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer

Distribution augmentation for low-resource expressive text-to-speech

Discrete acoustic space for an efficient sampling in neural text-to-speech

Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech

In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data

Sign in with Google

Consider DeepAI Pro