Learning Trajectory Prediction with Continuous Inverse Optimal Control via Langevin Sampling of Energy-Based Models

04/10/2019
by   Yifei Xu, et al.
0

Autonomous driving is a challenging multiagent domain which requires optimizing complex, mixed cooperative-competitive interactions. Learning to predict contingent distributions over other vehicles' trajectories simplifies the problem, allowing approximate solutions by trajectory optimization with dynamic constraints. We take a model-based approach to prediction, in order to make use of structured prior knowledge of vehicle kinematics, and the assumption that other drivers plan trajectories to minimize an unknown cost function. We introduce a novel inverse optimal control (IOC) algorithm to learn other vehicles' cost functions in an energy-based generative model. Langevin Sampling, a Monte Carlo based sampling algorithm, is used to directly sample the control sequence. Our algorithm provides greater flexibility than standard IOC methods, and can learn higher-level, non-Markovian cost functions defined over entire trajectories. We extend weighted feature-based cost functions with neural networks to obtain NN-augmented cost functions, which combine the advantages of both model-based and model-free learning. Results show that model-based IOC can achieve state-of-the-art vehicle trajectory prediction accuracy, and naturally take scene information into account.

READ FULL TEXT
research
03/01/2016

Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization

Reinforcement learning can acquire complex behaviors from high-level spe...
research
06/22/2020

Efficient Sampling-Based Maximum Entropy Inverse Reinforcement Learning with Application to Autonomous Driving

In the past decades, we have witnessed significant progress in the domai...
research
05/22/2018

Learning to Optimize via Wasserstein Deep Inverse Optimal Control

We study the inverse optimal control problem in social sciences: we aim ...
research
04/25/2021

A Robustness Analysis of Inverse Optimal Control of Bipedal Walking

Cost functions have the potential to provide compact and understandable ...
research
03/21/2018

Inverse Optimal Control with Incomplete Observations

In this article, we consider the inverse optimal control problem given i...
research
05/17/2023

Model-based Validation as Probabilistic Inference

Estimating the distribution over failures is a key step in validating au...
research
03/20/2021

Learning Continuous Cost-to-Go Functions for Non-holonomic Systems

This paper presents a supervised learning method to generate continuous ...

Please sign up or login with your details

Forgot password? Click here to reset