Mike Seltzer | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Ke Li
159 publications
Hang Su
109 publications
David Zhang
58 publications
Yun Wang
45 publications
Vikas Chandra
43 publications
Wenhan Xiong
39 publications
Yongqiang Wang
34 publications
Xiaohui Zhang
33 publications
Ozlem Kalinli
31 publications
Yangyang Shi
28 publications
Duc Le
28 publications

research

∙ 09/05/2023

TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models

Automatic Speech Recognition (ASR) models need to be optimized for speci...

0 Yuan Shangguan, et al. ∙

research

∙ 07/21/2023

Prompting Large Language Models with Speech Recognition Abilities

Large language models have proven themselves highly flexible, able to so...

0 Yassir Fathullah, et al. ∙

research

∙ 05/21/2023

Multi-Head State Space Model for Speech Recognition

State space models (SSMs) have recently shown promising results on small...

0 Yassir Fathullah, et al. ∙

research

∙ 10/25/2022

Dynamic Speech Endpoint Detection with Regression Targets

Interactive voice assistants have been widely used as input interfaces i...

0 Dawei Liang, et al. ∙

research

∙ 10/07/2021

Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution

This paper improves the streaming transformer transducer for speech reco...

0 Yangyang Shi, et al. ∙

research

∙ 10/07/2021

Transferring Voice Knowledge for Acoustic Event Detection: An Empirical Study

Detection of common events and scenes from audio is useful for extractin...

0 Dawei Liang, et al. ∙

research

∙ 07/09/2021

On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models

Hybrid automatic speech recognition (ASR) models are typically sequentia...

0 Xiaohui Zhang, et al. ∙

research

∙ 10/21/2020

Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition

This paper proposes an efficient memory transformer Emformer for low lat...

0 Yangyang Shi, et al. ∙

Success!

An error occurred