Formalizing the Problem of Side Effect Regularization

06/23/2022
by   Alexander Matt Turner, et al.
0

AI objectives are often hard to specify properly. Some approaches tackle this problem by regularizing the AI's side effects: Agents must weigh off "how much of a mess they make" with an imperfectly specified proxy objective. We propose a formal criterion for side effect regularization via the assistance game framework. In these games, the agent solves a partially observable Markov decision process (POMDP) representing its uncertainty about the objective function it should optimize. We consider the setting where the true objective is revealed to the agent at a later time step. We show that this POMDP is solved by trading off the proxy reward with the agent's ability to achieve a range of future tasks. We empirically demonstrate the reasonableness of our problem formalization via ground-truth evaluation in two gridworld environments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/14/2018

Maximizing Expected Impact in an Agent Reputation Network -- Technical Report

Many multi-agent systems (MASs) are situated in stochastic environments....
research
02/07/2021

Consequences of Misaligned AI

AI systems often rely on two key components: a specified goal or reward ...
research
03/02/2019

Tolling for Constraint Satisfaction in Markov Decision Process Congestion Games

Markov decision process (MDP) congestion game is an extension of classic...
research
09/29/2021

On Assessing the Usefulness of Proxy Domains for Developing and Evaluating Embodied Agents

In many situations it is either impossible or impractical to develop and...
research
03/11/2019

Deep Recurrent Q-Learning vs Deep Q-Learning on a simple Partially Observable Markov Decision Process with Minecraft

Deep Q-Learning has been successfully applied to a wide variety of tasks...
research
06/26/2023

Experiments with Detecting and Mitigating AI Deception

How to detect and mitigate deceptive AI systems is an open problem for t...
research
10/18/2021

Lifting DecPOMDPs for Nanoscale Systems – A Work in Progress

DNA-based nanonetworks have a wide range of promising use cases, especia...

Please sign up or login with your details

Forgot password? Click here to reset