Kentaro Tachibana

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Hiroshi Saruwatari
76 publications
Shinnosuke Takamichi
50 publications
Yuki Saito
22 publications
Tatsuya Komatsu
20 publications
Ryuichi Yamamoto
19 publications
Takaaki Saeki
18 publications
Eunwoo Song
13 publications
Yusuke Uchida
11 publications
Jae-Min Kim
11 publications
Tianqi Li
8 publications
Hyun-Wook Yoon
6 publications

research

∙ 09/15/2023

PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions

We propose PromptTTS++, a prompt-based text-to-speech (TTS) synthesis sy...

0 Reo Shimizu, et al. ∙

research

∙ 05/23/2023

ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings

We propose ChatGPT-EDSS, an empathetic dialogue speech synthesis (EDSS) ...

0 Yuki Saito, et al. ∙

research

∙ 05/23/2023

CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center

We present CALLS, a Japanese speech corpus that considers phone calls in...

0 Yuki Saito, et al. ∙

research

∙ 10/28/2022

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

We propose a lightweight end-to-end text-to-speech model using multi-ban...

0 Masaya Kawamura, et al. ∙

research

∙ 10/28/2022

Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis

Several fully end-to-end text-to-speech (TTS) models have been proposed ...

0 Yuma Shirahata, et al. ∙

research

∙ 10/28/2022

Nonparallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANs

Neural audio super-resolution models are typically trained on low- and h...

0 Reo Yoneyama, et al. ∙

research

∙ 06/16/2022

Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History

We propose an end-to-end empathetic dialogue speech synthesis (DSS) mode...

0 Yuto Nishimura, et al. ∙

research

∙ 04/21/2022

Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation

Data augmentation via voice conversion (VC) has been successfully applie...

0 Ryo Terashima, et al. ∙

research

∙ 03/29/2022

DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning

Most text-to-speech (TTS) methods use high-quality speech corpora record...

0 Takaaki Saeki, et al. ∙

research

∙ 03/28/2022

STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent

We present STUDIES, a new speech corpus for developing a voice agent tha...

0 Yuki Saito, et al. ∙

research

∙ 04/26/2021

Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis

We propose a novel phrase break prediction method that combines implicit...

0 Kosuke Futamata, et al. ∙

research

∙ 09/06/2018

Full-body High-resolution Anime Generation with Progressive Structure-conditional Generative Adversarial Networks

We propose Progressive Structure-conditional Generative Adversarial Netw...

0 Koichi Hamada, et al. ∙

Success!

An error occurred

Kentaro Tachibana

Featured Co-authors

Sign in with Google

Consider DeepAI Pro