Computing equilibria by minimizing exploitability with best-response ensembles

01/20/2023
by   Carlos Martin, et al.
0

In this paper, we study the problem of computing an approximate Nash equilibrium of a continuous game. Such games naturally model many situations involving space, time, money, and other fine-grained resources or quantities. The standard measure of the closeness of a strategy profile to Nash equilibrium is exploitability, which measures how much utility players can gain from changing their strategy unilaterally. We introduce a new equilibrium-finding method that minimizes an approximation of the exploitability. This approximation employs a best-response ensemble for each player that maintains multiple candidate best responses for that player. In each iteration, the best-performing element of each ensemble is used in a gradient-based scheme to update the current strategy profile. The strategy profile and best-response ensembles are simultaneously trained to minimize and maximize the approximate exploitability, respectively. Experiments on a suite of benchmark games show that it outperforms previous methods.

READ FULL TEXT
research
10/26/2019

Finding Mixed Strategy Nash Equilibrium for Continuous Games through Deep Learning

Nash equilibrium has long been a desired solution concept in multi-playe...
research
08/17/2021

Learning to Compute Approximate Nash Equilibrium for Normal-form Games

In this paper, we propose a general meta learning approach to computing ...
research
03/11/2021

XDO: A Double Oracle Algorithm for Extensive-Form Games

Policy Space Response Oracles (PSRO) is a deep reinforcement learning al...
research
01/05/2023

Algorithms and Complexity for Computing Nash Equilibria in Adversarial Team Games

Adversarial team games model multiplayer strategic interactions in which...
research
10/31/2020

When "Better" is better than "Best"

We consider two-player normal form games where each player has the same ...
research
02/09/2023

Regularization for Strategy Exploration in Empirical Game-Theoretic Analysis

In iterative approaches to empirical game-theoretic analysis (EGTA), the...
research
06/30/2021

Bounded rationality for relaxing best response and mutual consistency: An information-theoretic model of partial self-reference

While game theory has been transformative for decision-making, the assum...

Please sign up or login with your details

Forgot password? Click here to reset