Data-Efficient Policy Selection for Navigation in Partial Maps via Subgoal-Based Abstraction

04/03/2023
by   Abhishek Paudel, et al.
0

We present a novel approach for fast and reliable policy selection for navigation in partial maps. Leveraging the recent learning-augmented model-based Learning over Subgoals Planning (LSP) abstraction to plan, our robot reuses data collected during navigation to evaluate how well other alternative policies could have performed via a procedure we call offline alt-policy replay. Costs from offline alt-policy replay constrain policy selection among the LSP-based policies during deployment, allowing for improvements in convergence speed, cumulative regret and average navigation cost. With only limited prior knowledge about the nature of unseen environments, we achieve at least 67 cumulative regret over the baseline bandit approach in our experiments in simulated maze and office-like environments.

READ FULL TEXT

page 1

page 5

research
03/29/2023

Learning Augmented, Multi-Robot Long-Horizon Navigation in Partially Mapped Environments

We present a novel approach for efficient and reliable goal-directed lon...
research
02/24/2022

Uncertainty-driven Planner for Exploration and Navigation

We consider the problems of exploration and point-goal navigation in pre...
research
10/16/2020

Agile Robot Navigation through Hallucinated Learning and Sober Deployment

Learning from Hallucination (LfH) is a recent machine learning paradigm ...
research
03/23/2019

Long Range Neural Navigation Policies for the Real World

Learned Neural Network based policies have shown promising results for r...
research
06/16/2023

π2vec: Policy Representations with Successor Features

This paper describes π2vec, a method for representing behaviors of black...
research
04/22/2021

XAI-N: Sensor-based Robot Navigation using Expert Policies and Decision Trees

We present a novel sensor-based learning navigation algorithm to compute...
research
09/18/2022

A Computational Model of Learning Flexible Navigation in a Maze by Layout-Conforming Replay of Place Cells

Recent experimental observations have shown that the reactivation of hip...

Please sign up or login with your details

Forgot password? Click here to reset