In recent years, Automatic Speech Recognition (ASR) technology has appro...
This paper presents an edge-based defocus blur estimation method from a
...
Sequence to Sequence models, in particular the Transformer, achieve stat...
The audio-visual speech fusion strategy AV Align has shown significant
p...
The timings of spoken response offsets in human dialogue have been shown...
Audio-Visual Speech Recognition (AVSR) seeks to model, and thereby explo...
Automatic speech recognition can potentially benefit from the lip motion...
In human conversational interactions, turn-taking exchanges can be
coord...
For spoken dialog systems to conduct fluid conversational interactions w...
Finding visual features and suitable models for lipreading tasks that ar...