Learning from Incremental Directional Corrections

11/30/2020
by   Wanxin Jin, et al.
0

This paper proposes a technique which enables a robot to learn a control objective function incrementally from human user's corrections. The human's corrections can be as simple as directional corrections – corrections that indicate the direction of a control change without indicating its magnitude – applied at some time instances during the robot's motion. We only assume that each of the human's corrections, regardless of its magnitude, points in a direction that improves the robot's current motion relative to an implicit objective function. The proposed method uses the direction of a correction to update the estimate of the objective function based on a cutting plane technique. We establish the theoretical results to show that this process of incremental correction and update guarantees convergence of the learned objective function to the implicit one. The method is validated by both simulations and two human-robot games, where human players teach a 2-link robot arm and a 6-DoF quadrotor system for motion planning in environments with obstacles.

READ FULL TEXT
research
08/05/2020

Learning from Sparse Demonstrations

This paper proposes an approach which enables a robot to learn an object...
research
11/06/2021

Flying Trapeze Act Motion Planning Algorithm for Two-Link Free-Flying Acrobatic Robot

A flying trapeze act can be a challenging task for a robotics system sin...
research
03/26/2020

Integrating Combined Task and Motion Planning with Compliant Control

Planning a motion for inserting pegs remains an open problem. The diffic...
research
03/29/2017

On Convergence Property of Implicit Self-paced Objective

Self-paced learning (SPL) is a new methodology that simulates the learni...
research
03/02/2020

The Use of Implicit Human Motor Behaviour in the Online Personalisation of Prosthetic Interfaces

In previous work, the authors proposed a data-driven optimisation algori...
research
08/24/2021

Learning to Arbitrate Human and Robot Control using Disagreement between Sub-Policies

In the context of teleoperation, arbitration refers to deciding how to b...
research
10/28/2020

Inverse Optimal Control from Demonstration Segments

This paper develops an inverse optimal control method to learn an object...

Please sign up or login with your details

Forgot password? Click here to reset