The detour problem in a stochastic environment: Tolman revisited

09/27/2017
by   Pegah Fakhari, et al.
0

We designed a grid world task to study human planning and re-planning behavior in an unknown stochastic environment. In our grid world, participants were asked to travel from a random starting point to a random goal position while maximizing their reward. Because they were not familiar with the environment, they needed to learn its characteristics from experience to plan optimally. Later in the task, we randomly blocked the optimal path to investigate whether and how people adjust their original plans to find a detour. To this end, we developed and compared 12 different models. These models were different on how they learned and represented the environment and how they planned to catch the goal. The majority of our participants were able to plan optimally. We also showed that people were capable of revising their plans when an unexpected event occurred. The result from the model comparison showed that the model-based reinforcement learning approach provided the best account for the data and outperformed heuristics in explaining the behavioral data in the re-planning trials.

READ FULL TEXT

page 9

page 12

page 14

page 15

page 17

research
05/16/2018

Modeling Human Inference of Others' Intentions in Complex Situations with Plan Predictability Bias

A recent approach based on Bayesian inverse planning for the "theory of ...
research
02/13/2020

The Efficiency of Human Cognition Reflects Planned Information Processing

Planning is useful. It lets people take actions that have desirable long...
research
07/27/2023

Thinker: Learning to Plan and Act

We propose the Thinker algorithm, a novel approach that enables reinforc...
research
05/24/2017

Efficient, Safe, and Probably Approximately Complete Learning of Action Models

In this paper we explore the theoretical boundaries of planning in a set...
research
06/05/2013

Inferring Robot Task Plans from Human Team Meetings: A Generative Modeling Approach with Logic-Based Prior

We aim to reduce the burden of programming and deploying autonomous syst...
research
07/27/2020

Resource-rational Task Decomposition to Minimize Planning Costs

People often plan hierarchically. That is, rather than planning over a m...
research
05/14/2021

Control of mental representations in human planning

One of the most striking features of human cognition is the capacity to ...

Please sign up or login with your details

Forgot password? Click here to reset