Nowadays, it is common for people to take photographs of every beverage,...
We present an expanded version of our previously released Kazakh
text-to...
We present the development of a dataset for Kazakh named entity recognit...
In this paper, we study an approach to multimodal person verification us...
We study training a single end-to-end (E2E) automatic speech recognition...
We present a freely available speech corpus for the Uzbek language and r...
Alzheimer's disease (AD) is a progressive brain disorder that causes mem...
Ideally, accurate sensor measurements are needed to achieve a good
perfo...
This paper introduces a high-quality open-source speech synthesis datase...
We present SpeakingFaces as a publicly-available large-scale multimodal
...
We present an open-source speech corpus for the Kazakh language. The Kaz...
In this work, we present an end-to-end deep learning framework for X-ray...
The sense of touch is essential for reliable mapping between the environ...