Nithin Rao Koluguri | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Shrikanth Narayanan
93 publications
Boris Ginsburg
44 publications
Somshubra Majumdar
15 publications
Manoj Kumar
13 publications
Vahid Noroozi
11 publications
Tae Jin Park
10 publications
Jagadeesh Balam
8 publications
Kunal Dhawan
8 publications
Fei Jia
6 publications
Samuel Kriman
4 publications
Dima Rekesh
3 publications

research

∙ 09/19/2023

Discrete Audio Representation as an Alternative to Mel-Spectrograms for Speaker and Speech Recognition

Discrete audio representation, aka audio tokenization, has seen renewed ...

0 Krishna C. Puvvada, et al. ∙

research

∙ 09/18/2023

Investigating End-to-End ASR Architectures for Long Form Audio Transcription

This paper presents an overview and evaluation of some of the end-to-end...

0 Nithin Rao Koluguri, et al. ∙

research

∙ 10/27/2022

AmberNet: A Compact End-to-End Model for Spoken Language Identification

We present AmberNet, a compact end-to-end neural network for Spoken Lang...

0 Fei Jia, et al. ∙

research

∙ 03/30/2022

Multi-scale Speaker Diarization with Dynamic Scale Weighting

Speaker diarization systems are challenged by a trade-off between the te...

0 Tae Jin Park, et al. ∙

research

∙ 10/08/2021

TitaNet: Neural Model for speaker representation with 1D Depth-wise separable convolutions and global context

In this paper, we propose TitaNet, a novel neural network architecture f...

0 Nithin Rao Koluguri, et al. ∙

research

∙ 10/24/2019

Meta-learning for robust child-adult classification from speech

Computational modeling of naturalistic conversations in clinical applica...

0 Nithin Rao Koluguri, et al. ∙

Success!

An error occurred