Discrete audio representation, aka audio tokenization, has seen renewed
...
This paper presents an overview and evaluation of some of the end-to-end...
We present AmberNet, a compact end-to-end neural network for Spoken Lang...
Speaker diarization systems are challenged by a trade-off between the
te...
In this paper, we propose TitaNet, a novel neural network architecture f...
Computational modeling of naturalistic conversations in clinical applica...