In this study, we propose a method for jointly learning of images and vi...
In this paper we propose an extension of the Attention Branch Network (A...
In the design of action recognition models, the quality of videos in the...
In this paper, we propose a multi-domain learning model for action
recog...
We propose Multi-head Self/Cross-Attention (MSCA), which introduces a
te...
In this paper, we propose a data augmentation method for action recognit...
In this paper we propose a novel method which leverages the uncertainty
...
We propose a novel approach to identify the difficulty of visual questio...
Visual question answering (VQA) is a task of answering a visual question...
We propose an easy-to-use non-overlapping camera calibration method. Fir...
Current video captioning approaches often suffer from problems of missin...
An efficient inverse reinforcement learning for generating trajectories ...
In this paper, we propose a method for semantic segmentation of pedestri...
In this paper we propose a novel deep learning-based algorithm for biome...
Path prediction is a fundamental task for estimating how pedestrians or
...
In many cases, such as trajectories clustering and classification, we of...
We propose a feature for action recognition called Trajectory-Set (TS), ...
Colorectal endoscopy is important for the early detection and treatment ...
This paper proposes a method for domain adaptation that extends the maxi...
In this paper we propose a method for transfer learning of endoscopic im...
In this paper we report results for recognizing colorectal NBI endoscopi...