Active Altruism Learning and Information Sufficiency for Autonomous Driving

10/09/2021
by   Jack Geary, et al.
0

Safe interaction between vehicles requires the ability to choose actions that reveal the preferences of the other vehicles. Since exploratory actions often do not directly contribute to their objective, an interactive vehicle must also able to identify when it is appropriate to perform them. In this work we demonstrate how Active Learning methods can be used to incentivise an autonomous vehicle (AV) to choose actions that reveal information about the altruistic inclinations of another vehicle. We identify a property, Information Sufficiency, that a reward function should have in order to keep exploration from unnecessarily interfering with the pursuit of an objective. We empirically demonstrate that reward functions that do not have Information Sufficiency are prone to inadequate exploration, which can result in sub-optimal behaviour. We propose a reward definition that has Information Sufficiency, and show that it facilitates an AV choosing exploratory actions to estimate altruistic tendency, whilst also compensating for the possibility of conflicting beliefs between vehicles.

READ FULL TEXT
research
10/07/2020

Modeling Human Driving Behavior in Highway Scenario using Inverse Reinforcement Learning

Human driving behavior modeling is of great importance for designing saf...
research
01/29/2019

Safe, Efficient, and Comfortable Velocity Control based on Reinforcement Learning for Autonomous Driving

A model used for velocity control during car following was proposed base...
research
09/11/2019

Predicting optimal value functions by interpolating reward functions in scalarized multi-objective reinforcement learning

A common approach for defining a reward function for Multi-objective Rei...
research
06/26/2019

NeuroTrajectory: A Neuroevolutionary Approach to Local State Trajectory Learning for Autonomous Vehicles

Autonomous vehicles are controlled today either based on sequences of de...
research
04/05/2022

Inferring Rewards from Language in Context

In classic instruction following, language like "I'd like the JetBlue fl...
research
09/30/2021

Emergency Vehicles Audio Detection and Localization in Autonomous Driving

Emergency vehicles in service have right-of-way over all other vehicles....
research
11/17/2021

GFlowNet Foundations

Generative Flow Networks (GFlowNets) have been introduced as a method to...

Please sign up or login with your details

Forgot password? Click here to reset