Time-Optimal Path Tracking for Industrial Robots: A Dynamic Model-Free Reinforcement Learning Approach

07/02/2019
by   Jiadong Xiao, et al.
0

In pursuit of the time-optimal path tracking (TOPT) trajectory of a robot manipulator along a preset path, a beforehand identified robot dynamic model is usually used to obtain the required optimal trajectory for perfect tracking. However, due to the inevitable model-plant mismatch, there may be a big error between the actually measured torques and the calculated torques by the dynamic model, which causes the obtained trajectory to be suboptimal or even be infeasible by exceeding given limits. This paper presents a TOPT-oriented SARSA algorithm (TOPTO-SARSA) and a two-step method for finding the time-optimal motion and ensuring the feasibility : Firstly, using TOPTO-SARSA to find a safe trajectory that satisfies the kinematic constraints through the interaction between reinforcement learning agent and kinematic model. Secondly, using TOPTO-SARSA to find the optimal trajectory through the interaction between the agent and the real world, and assure the actually measured torques satisfy the given limits at the last interaction. The effectiveness of the proposed algorithm has been verified through experiments on a 6-DOF robot manipulator.

READ FULL TEXT
research
07/02/2019

Time-optimal path tracking for industrial robot: A model-free reinforcement approach

In pursuit of the time-optimal motion of a robot manipulator along a pre...
research
06/30/2019

Reinforcement Learning for Robotic Time-optimal Path Tracking Using Prior Knowledge

Time-optimal path tracking, as a significant tool for industrial robots,...
research
05/31/2023

Regulated Pure Pursuit for Robot Path Tracking

The accelerated deployment of service robots have spawned a number of al...
research
03/05/2021

Learning Collision-free and Torque-limited Robot Trajectories based on Alternative Safe Behaviors

This paper presents an approach to learn online generation of collision-...
research
09/07/2022

Efficient Trajectory Planning and Control for USV with Vessel Dynamics and Differential Flatness

Unmanned surface vessels (USVs) are widely used in ocean exploration and...
research
03/03/2022

Learning Time-optimized Path Tracking with or without Sensory Feedback

In this paper, we present a learning-based approach that allows a robot ...
research
07/23/2020

Challenging common bolus advisor for self-monitoring type-I diabetes patients using Reinforcement Learning

Patients with diabetes who are self-monitoring have to decide right befo...

Please sign up or login with your details

Forgot password? Click here to reset