Rohan Badlani | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Bhiksha Raj
84 publications
Bryan Catanzaro
61 publications
Xiaoyu Chen
47 publications
Anurag Kumar
45 publications
Emmanuel Vincent
32 publications
Wei Ping
32 publications
Ian Lane
25 publications
Benjamin Elizalde
19 publications
Kevin J. Shih
15 publications
Adrian Łańcucki
14 publications
Rafael Valle
13 publications

research

∙ 01/24/2023

Multilingual Multiaccented Multispeaker TTS with RADTTS

We work to create a multilingual speech synthesis system which can gener...

0 Rohan Badlani, et al. ∙

research

∙ 03/03/2022

Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows

Despite recent advances in generative modeling for text-to-speech synthe...

0 Kevin J. Shih, et al. ∙

research

∙ 08/23/2021

One TTS Alignment To Rule Them All

Speech-to-text alignment is a critical component of neural textto-speech...

0 Rohan Badlani, et al. ∙

research

∙ 11/19/2020

Relation Extraction with Contextualized Relation Embedding (CRE)

Relation extraction is the task of identifying relation instance between...

0 Xiaoyu Chen, et al. ∙

research

∙ 01/17/2018

NELS - Never-Ending Learner of Sounds

Sounds are essential to how humans perceive and interact with the world ...

0 Benjamin Elizalde, et al. ∙

research

∙ 11/02/2017

Framework for evaluation of sound event detection in web videos

The largest source of sound events is web videos. Most videos lack sound...

0 Rohan Badlani, et al. ∙

research

∙ 10/30/2017

Content-based Representations of audio using Siamese neural networks

In this paper, we focus on the problem of content-based retrieval for au...

0 Pranay Manocha, et al. ∙

research

∙ 07/22/2016

Experiments on the DCASE Challenge 2016: Acoustic Scene Classification and Sound Event Detection in Real Life Recording

In this paper we present our work on Task 1 Acoustic Scene Classi- ficat...

0 Benjamin Elizalde, et al. ∙

Success!

An error occurred