Interleaving Fast and Slow Decision Making

10/30/2020
by   Aditya Gulati, et al.
18

The "Thinking, Fast and Slow" paradigm of Kahneman proposes that we use two different styles of thinking – a fast and intuitive System 1 for certain tasks, along with a slower but more analytical System 2 for others. While the idea of using this two-system style of thinking is gaining popularity in AI and robotics, our work considers how to interleave the two styles of decision-making, i.e., how System 1 and System 2 should be used together. For this, we propose a novel and general framework which includes a new System 0 to oversee Systems 1 and 2. At every point when a decision needs to be made, System 0 evaluates the situation and quickly hands over the decision-making process to either System 1 or System 2. We evaluate such a framework on a modified version of the classic Pac-Man game, with an already-trained RL algorithm for System 1, a Monte-Carlo tree search for System 2, and several different possible strategies for System 0. As expected, arbitrary switches between Systems 1 and 2 do not work, but certain strategies do well. With System 0, an agent is able to perform better than one that uses only System 1 or System 2.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/06/2019

Combining Planning and Deep Reinforcement Learning in Tactical Decision Making for Autonomous Driving

Tactical decision making for autonomous driving is challenging due to th...
research
09/24/2021

MCTS Based Agents for Multistage Single-Player Card Game

The article presents the use of Monte Carlo Tree Search algorithms for t...
research
11/09/2019

Markov-chain Monte-Carlo Sampling for Optimal Fidelity Determination in Dynamic Decision-Making

Decision making for dynamic systems is challenging due to the scale and ...
research
04/17/2021

Generating Diverse and Competitive Play-Styles for Strategy Games

Designing agents that are able to achieve different play-styles while ma...
research
07/04/2010

A Fast Decision Technique for Hierarchical Hough Transform for Line Detection

Many techniques have been proposed to speedup the performance of classic...
research
02/18/2021

L2E: Learning to Exploit Your Opponent

Opponent modeling is essential to exploit sub-optimal opponents in strat...
research
08/15/2021

A Fast Algorithm for Computing the Deficiency Number of a Mahjong Hand

The tile-based multiplayer game Mahjong is widely played in Asia and has...

Please sign up or login with your details

Forgot password? Click here to reset