Recent work has shown that it is possible to resynthesize high-quality s...
Adaptive inference is a simple method for reducing inference costs. The
...
Speech language models (SpeechLMs) process and generate acoustic data on...
The attention mechanism is considered the backbone of the widely-used
Tr...
Getting the most out of limited resources allows advances in natural lan...
In this paper we present VDTTS, a Visually-Driven Text-to-Speech model.
...