Enter the Matrix: A Virtual World Approach to Safely Interruptable Autonomous Systems

03/30/2017
by   Mark O. Riedl, et al.
0

Robots and autonomous systems that operate around humans will likely always rely on kill switches that stop their execution and allow them to be remote-controlled for the safety of humans or to prevent damage to the system. It is theoretically possible for an autonomous system with sufficient sensor and effector capability and using reinforcement learning to learn that the kill switch deprives it of long-term reward and learn to act to disable the switch or otherwise prevent a human operator from using the switch. This is referred to as the big red button problem. We present a technique which prevents a reinforcement learning agent from learning to disable the big red button. Our technique interrupts the agent or robot by placing it in a virtual simulation where it continues to receive reward. We illustrate our technique in a simple grid world environment.

READ FULL TEXT
research
02/11/2020

Learning to Switch Between Machines and Humans

Reinforcement learning algorithms have been mostly developed and evaluat...
research
10/30/2015

Turing's Red Flag

Sometime in the future we will have to deal with the impact of AI's bein...
research
02/18/2019

Parenting: Safe Reinforcement Learning from Human Input

Autonomous agents trained via reinforcement learning present numerous sa...
research
03/03/2019

Hacking Google reCAPTCHA v3 using Reinforcement Learning

We present a Reinforcement Learning (RL) methodology to bypass Google re...
research
03/18/2020

Social navigation with human empowerment driven reinforcement learning

The next generation of mobile robots needs to be socially-compliant to b...
research
03/01/2021

Virtual Adversarial Humans finding Hazards in Robot Workplaces

During the planning phase of industrial robot workplaces, hazard analyse...
research
11/24/2016

The Off-Switch Game

It is clear that one of the primary tools we can use to mitigate the pot...

Please sign up or login with your details

Forgot password? Click here to reset