Deep Reinforcement Learning with Stage Incentive Mechanism for Robotic Trajectory Planning

by   Jin Yang, et al.

To improve the efficiency of deep reinforcement learning (DRL) based methods for robot manipulator trajectory planning in random working environment. Different from the traditional sparse reward function, we present three dense reward functions in this paper. Firstly, posture reward function is proposed to accelerate the learning process with a more reasonable trajectory by modeling the distance and direction constraints, which can reduce the blindness of exploration. Secondly, to improve the stability, a reward function at stride reward is proposed by modeling the distance and movement distance of joints constraints, it can make the learning process more stable. In order to further improve learning efficiency, we are inspired by the cognitive process of human behavior and propose a stage incentive mechanism, including hard stage incentive reward function and soft stage incentive reward function. Extensive experiments show that the soft stage incentive reward function proposed is able to improve convergence rate by up to 46.9 methods. The percentage increase in convergence mean reward is 4.4 the percentage decreases with respect to standard deviation by 21.9 the evaluation, the success rate of trajectory planning for robot manipulator is up to 99.6


page 7

page 11

page 13

page 15

page 17


Pitfalls of learning a reward function online

In some agent designs like inverse reinforcement learning an agent needs...

Self-Refined Large Language Model as Automated Reward Function Designer for Deep Reinforcement Learning in Robotics

Although Deep Reinforcement Learning (DRL) has achieved notable success ...

Internally Rewarded Reinforcement Learning

We study a class of reinforcement learning problems where the reward sig...

Efficient Multi-robot Exploration via Multi-head Attention-based Cooperation Strategy

The goal of coordinated multi-robot exploration tasks is to employ a tea...

Incorrigibility in the CIRL Framework

A value learning system has incentives to follow shutdown instructions, ...

Dexterous Robotic Manipulation using Deep Reinforcement Learning and Knowledge Transfer for Complex Sparse Reward-based Tasks

This paper describes a deep reinforcement learning (DRL) approach that w...

Sequence Modeling of Temporal Credit Assignment for Episodic Reinforcement Learning

Recent advances in deep reinforcement learning algorithms have shown gre...

Please sign up or login with your details

Forgot password? Click here to reset