Attention-SLAM: A Visual Monocular SLAM Learning from Human Gaze

by   Jinquan Li, et al.

This paper proposes a novel simultaneous localization and mapping (SLAM) approach, namely Attention-SLAM, which simulates human navigation mode by combining a visual saliency model (SalNavNet) with traditional monocular visual SLAM. Most SLAM methods treat all the features extracted from the images as equal importance during the optimization process. However, the salient feature points in scenes have more significant influence during the human navigation process. Therefore, we first propose a visual saliency model called SalVavNet in which we introduce a correlation module and propose an adaptive Exponential Moving Average (EMA) module. These modules mitigate the center bias to enable the saliency maps generated by SalNavNet to pay more attention to the same salient object. Moreover, the saliency maps simulate the human behavior for the refinement of SLAM results. The feature points extracted from the salient regions have greater importance in optimization process. We add semantic saliency information to the Euroc dataset to generate an open-source saliency SLAM dataset. Comprehensive test results prove that Attention-SLAM outperforms benchmarks such as Direct Sparse Odometry (DSO), ORB-SLAM, and Salient DSO in terms of efficiency, accuracy, and robustness in most test cases.


page 1

page 2

page 3

page 4

page 6

page 7

page 8

page 12


Salient Bundle Adjustment for Visual SLAM

Recently, the philosophy of visual saliency and attention has started to...

Loosely-Coupled Semi-Direct Monocular SLAM

We propose a novel semi-direct approach for monocular simultaneous local...

SalientDSO: Bringing Attention to Direct Sparse Odometry

Although cluttered indoor scenes have a lot of useful high-level semanti...

Attend Before you Act: Leveraging human visual attention for continual learning

When humans perform a task, such as playing a game, they selectively pay...

HDPV-SLAM: Hybrid Depth-augmented Panoramic Visual SLAM for Mobile Mapping System with Tilted LiDAR and Panoramic Visual Camera

This paper proposes a novel visual simultaneous localization and mapping...

Challenges in Monocular Visual Odometry: Photometric Calibration, Motion Bias and Rolling Shutter Effect

Monocular visual odometry (VO) has seen tremendous improvements in accur...

RGB-D SLAM Using Attention Guided Frame Association

Deep learning models as an emerging topic have shown great progress in v...

Please sign up or login with your details

Forgot password? Click here to reset