A Multilingual Keyword Spotting (KWS) system detects spokenkeywords over...
In this work we propose a novel token-based training strategy that impro...
In this paper, we present a novel approach to adapt a sequence-to-sequen...
While recent research advances in speaker diarization mostly focus on
im...
We trained a keyword spotting model using federated learning on real use...
This paper presents a novel study of parameter-free attentive scoring fo...
In this paper, we introduce a novel language identification system based...
In this paper, we present a novel speaker diarization system for streami...
We propose self-training with noisy student-teacher approach for streami...
In this paper, we describe SpeakerStew - a hybrid system to perform spea...
Many neural network speaker recognition systems model each speaker using...
We introduce VoiceFilter-Lite, a single-channel source separation model ...
This paper discusses one of the most challenging practical engineering
p...
We demonstrate that a production-quality keyword-spotting model can be
t...
Google's multilingual speech recognition system combines low-level acous...
In this paper, we propose "personal VAD", a system to detect the voice
a...
In many scenarios of a language identification task, the user will speci...
In this paper, we present a novel system that separates the voice of a t...
We describe a neural network-based system for text-to-speech (TTS) synth...
We present a novel algorithm, called Links, designed to perform online
c...
Attention-based models have recently shown great performance on a range ...
For many years, i-vector based speaker embedding techniques were the dom...
In this paper, we propose a new loss function called generalized end-to-...