Ignacio Lopez Moreno

research

∙ 02/25/2023

Locale Encoding For Scalable Multilingual Keyword Spotting Models

A Multilingual Keyword Spotting (KWS) system detects spokenkeywords over...

0 Pai Zhu, et al. ∙

research

∙ 11/11/2022

Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss

In this work we propose a novel token-based training strategy that impro...

0 Guanlong Zhao, et al. ∙

research

∙ 11/11/2022

Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting

In this paper, we present a novel approach to adapt a sequence-to-sequen...

0 Beltrán Labrador, et al. ∙

research

∙ 10/25/2022

Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering

While recent research advances in speaker diarization mostly focus on im...

0 Quan Wang, et al. ∙

research

∙ 04/11/2022

Production federated keyword spotting via distillation, filtering, and joint federated-centralized training

We trained a keyword spotting model using federated learning on real use...

0 Andrew Hard, et al. ∙

research

∙ 03/10/2022

Parameter-Free Attentive Scoring for Speaker Verification

This paper presents a novel study of parameter-free attentive scoring fo...

0 Jason Pelecanos, et al. ∙

research

∙ 02/24/2022

Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech

In this paper, we introduce a novel language identification system based...

0 Quan Wang, et al. ∙

research

∙ 09/23/2021

Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection

In this paper, we present a novel speaker diarization system for streami...

0 Wei Xia, et al. ∙

research

∙ 06/03/2021

Noisy student-teacher training for robust keyword spotting

We propose self-training with noisy student-teacher approach for streami...

0 Hyun-Jin Park, et al. ∙

research

∙ 04/05/2021

SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System

In this paper, we describe SpeakerStew - a hybrid system to perform spea...

0 Roza Chojnacka, et al. ∙

research

∙ 04/05/2021

Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker Recognition

Many neural network speaker recognition systems model each speaker using...

0 Jason Pelecanos, et al. ∙

research

∙ 09/09/2020

VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition

We introduce VoiceFilter-Lite, a single-channel source separation model ...

3 Quan Wang, et al. ∙

research

∙ 07/23/2020

Version Control of Speaker Recognition Systems

This paper discusses one of the most challenging practical engineering p...

0 Quan Wang, et al. ∙

research

∙ 05/21/2020

Training Keyword Spotting Models on Non-IID Data with Federated Learning

We demonstrate that a production-quality keyword-spotting model can be t...

0 Andrew Hard, et al. ∙

research

∙ 10/21/2019

Signal Combination for Language Identification

Google's multilingual speech recognition system combines low-level acous...

0 Shengye Wang, et al. ∙

research

∙ 08/12/2019

Personal VAD: Speaker-Conditioned Voice Activity Detection

In this paper, we propose "personal VAD", a system to detect the voice a...

0 Shaojin Ding, et al. ∙

research

∙ 11/29/2018

Tuplemax Loss for Language Identification

In many scenarios of a language identification task, the user will speci...

0 Li Wan, et al. ∙

research

∙ 10/11/2018

VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking

In this paper, we present a novel system that separates the voice of a t...

0 Quan Wang, et al. ∙

research

∙ 06/12/2018

Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis

We describe a neural network-based system for text-to-speech (TTS) synth...

0 Ye Jia, et al. ∙

research

∙ 01/30/2018

Links: A High-Dimensional Online Clustering Method

We present a novel algorithm, called Links, designed to perform online c...

0 Philip Andrew Mansfield, et al. ∙

research

∙ 10/28/2017

Attention-Based Models for Text-Dependent Speaker Verification

Attention-based models have recently shown great performance on a range ...

0 F A Rezaur Rahman Chowdhury, et al. ∙

research

∙ 10/28/2017

Speaker Diarization with LSTM

For many years, i-vector based speaker embedding techniques were the dom...

0 Quan Wang, et al. ∙

research

∙ 10/28/2017

Generalized End-to-End Loss for Speaker Verification

In this paper, we propose a new loss function called generalized end-to-...

0 Li Wan, et al. ∙

Ignacio Lopez Moreno

Featured Co-authors

Sign in with Google

Consider DeepAI Pro