Memory Bounded Open-Loop Planning in Large POMDPs using Thompson Sampling

05/10/2019
by   Thomy Phan, et al.
0

State-of-the-art approaches to partially observable planning like POMCP are based on stochastic tree search. While these approaches are computationally efficient, they may still construct search trees of considerable size, which could limit the performance due to restricted memory resources. In this paper, we propose Partially Observable Stacked Thompson Sampling (POSTS), a memory bounded approach to open-loop planning in large POMDPs, which optimizes a fixed size stack of Thompson Sampling bandits. We empirically evaluate POSTS in four large benchmark problems and compare its performance with different tree-based approaches. We show that POSTS achieves competitive performance compared to tree-based open-loop planning and offers a performance-memory tradeoff, making it suitable for partially observable planning with highly restricted computational and memory resources.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/11/2019

Adaptive Thompson Sampling Stacks for Memory Bounded Open-Loop Planning

We propose Stable Yet Memory Bounded Open-Loop (SYMBOL) planning, a gene...
research
05/03/2018

Open Loop Execution of Tree-Search Algorithms

In the context of tree-search stochastic planning algorithms where a gen...
research
04/05/2019

Combining Offline Models and Online Monte-Carlo Tree Search for Planning from Scratch

Planning in stochastic and partially observable environments is a centra...
research
03/15/2012

Distribution over Beliefs for Memory Bounded Dec-POMDP Planning

We propose a new point-based method for approximate planning in Dec-POMD...
research
06/08/2021

Efficient Sampling in POMDPs with Lipschitz Bandits for Motion Planning in Continuous Spaces

Decision making under uncertainty can be framed as a partially observabl...
research
09/19/2023

Asymptotically Optimal Belief Space Planning in Discrete Partially-Observable Domains

Robots often have to operate in discrete partially observable worlds, wh...
research
04/09/2019

Practical Open-Loop Optimistic Planning

We consider the problem of online planning in a Markov Decision Process ...

Please sign up or login with your details

Forgot password? Click here to reset