Hybrid of representation learning and reinforcement learning for dynamic and complex robotic motion planning

09/07/2023
by   Chengmin Zhou, et al.
0

Motion planning is the soul of robot decision making. Classical planning algorithms like graph search and reaction-based algorithms face challenges in cases of dense and dynamic obstacles. Deep learning algorithms generate suboptimal one-step predictions that cause many collisions. Reinforcement learning algorithms generate optimal or near-optimal time-sequential predictions. However, they suffer from slow convergence, suboptimal converged results, and overfittings. This paper introduces a hybrid algorithm for robotic motion planning: long short-term memory (LSTM) pooling and skip connection for attention-based discrete soft actor critic (LSA-DSAC). First, graph network (relational graph) and attention network (attention weight) interpret the environmental state for the learning of the discrete soft actor critic algorithm. The expressive power of attention network outperforms that of graph in our task by difference analysis of these two representation methods. However, attention based DSAC faces the overfitting problem in training. Second, the skip connection method is integrated to attention based DSAC to mitigate overfitting and improve convergence speed. Third, LSTM pooling is taken to replace the sum operator of attention weigh and eliminate overfitting by slightly sacrificing convergence speed at early-stage training. Experiments show that LSA-DSAC outperforms the state-of-the-art in training and most evaluations. The physical robot is also implemented and tested in the real world.

READ FULL TEXT

page 15

page 16

research
02/05/2021

An advantage actor-critic algorithm for robotic motion planning in dense and dynamic scenarios

Intelligent robots provide a new insight into efficiency improvement in ...
research
02/04/2021

A review of motion planning algorithms for intelligent robotics

We investigate and analyze principles of typical motion planning algorit...
research
11/13/2020

Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning

Constructing agents with planning capabilities has long been one of the ...
research
12/20/2022

Variational Quantum Soft Actor-Critic for Robotic Arm Control

Deep Reinforcement Learning is emerging as a promising approach for the ...
research
03/05/2019

Deep Learning Based Motion Planning For Autonomous Vehicle Using Spatiotemporal LSTM Network

Motion Planning, as a fundamental technology of automatic navigation for...
research
02/26/2020

When Do Drivers Concentrate? Attention-based Driver Behavior Modeling With Deep Reinforcement Learning

Driver distraction a significant risk to driving safety. Apart from spat...
research
05/23/2019

Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire Evacuation Environment

We focus on the important problem of emergency evacuation, which clearly...

Please sign up or login with your details

Forgot password? Click here to reset