Accurate prediction of the user intent to interact with a voice assistan...
Voice trigger detection is an important task, which enables activating a...
We address the problem of detecting speech directed to a device that doe...
Deriving multimodal representations of audio and lexical inputs is a cen...
It is desirable for a text-to-speech system to take into account the
env...
We present an architecture for voice trigger detection for virtual
assis...
In this paper, we address the task of determining whether a given uttera...
We present an introspection of an audiovisual speech enhancement model. ...
We present progress towards bilingual Text-to-Speech which is able to
tr...
Emotion plays an essential role in human-to-human communication, enablin...
Automatic speech transcription and speaker recognition are usually treat...
Millions of people reach out to digital assistants such as Siri every da...
We introduce a recurrent neural network architecture for automated road
...