Efficient exploration of zero-sum stochastic games

02/24/2020
by   Carlos Martin, et al.
8

We investigate the increasingly important and common game-solving setting where we do not have an explicit description of the game but only oracle access to it through gameplay, such as in financial or military simulations and computer games. During a limited-duration learning phase, the algorithm can control the actions of both players in order to try to learn the game and how to play it well. After that, the algorithm has to produce a strategy that has low exploitability. Our motivation is to quickly learn strategies that have low exploitability in situations where evaluating the payoffs of a queried strategy profile is costly. For the stochastic game setting, we propose using the distribution of state-action value functions induced by a belief distribution over possible environments. We compare the performance of various exploration strategies for this task, including generalizations of Thompson sampling and Bayes-UCB to this new setting. These two consistently outperform other strategies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/17/2011

Computing Strong Game-Theoretic Strategies in Jotto

We develop a new approach that computes approximate equilibrium strategi...
research
06/08/2016

Learning Language Games through Interaction

We introduce a new language learning setting relevant to building adapti...
research
05/21/2021

Evaluating Strategy Exploration in Empirical Game-Theoretic Analysis

In empirical game-theoretic analysis (EGTA), game models are extended it...
research
10/16/2012

Inferring Strategies from Limited Reconnaissance in Real-time Strategy Games

In typical real-time strategy (RTS) games, enemy units are visible only ...
research
12/07/2018

Taking the Scenic Route: Automatic Exploration for Videogames

Machine playtesting tools and game moment search engines require exposur...
research
11/18/2019

Learning Probably Approximately Correct Maximin Strategies in Simulation-Based Games with Infinite Strategy Spaces

We tackle the problem of learning equilibria in simulation-based games. ...
research
07/05/2018

Surprising strategies obtained by stochastic optimization in partially observable games

This paper studies the optimization of strategies in the context of poss...

Please sign up or login with your details

Forgot password? Click here to reset