Nash Convergence of Mean-Based Learning Algorithms in First Price Auctions

10/08/2021
by   Xiaotie Deng, et al.
0

Understanding the convergence properties of learning dynamics in repeated auctions is a timely and important question in the area of learning in auctions, with numerous applications in, e.g., online advertising markets. This work focuses on repeated first price auctions where bidders with fixed values for the item learn to bid using mean-based algorithms – a large class of online learning algorithms that include popular no-regret algorithms such as Multiplicative Weights Update and Follow the Perturbed Leader. We completely characterize the learning dynamics of mean-based algorithms, in terms of convergence to a Nash equilibrium of the auction, in two senses: (1) time-average: the fraction of rounds where bidders play a Nash equilibrium approaches 1 in the limit; (2)last-iterate: the mixed strategy profile of bidders approaches a Nash equilibrium in the limit. Specifically, the results depend on the number of bidders with the highest value: - If the number is at least three, the bidding dynamics almost surely converges to a Nash equilibrium of the auction, both in time-average and in last-iterate. - If the number is two, the bidding dynamics almost surely converges to a Nash equilibrium in time-average but not necessarily in last-iterate. - If the number is one, the bidding dynamics may not converge to a Nash equilibrium in time-average nor in last-iterate. Our discovery opens up new possibilities in the study of convergence dynamics of learning algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/14/2020

Convergence Analysis of No-Regret Bidding Algorithms in Repeated Auctions

The connection between games and no-regret algorithms has been widely st...
research
12/21/2017

Infinitely Split Nash Equilibrium Problems in Repeated Games

In this paper, we introduce the concept of infinitely split Nash equilib...
research
09/19/2023

Game Connectivity and Adaptive Dynamics

We analyse the typical structure of games in terms of the connectivity p...
research
12/20/2019

No-Regret Learning from Partially Observed Data in Repeated Auctions

We study a general class of repeated auctions, such as the ones found in...
research
03/05/2021

Learning in Matrix Games can be Arbitrarily Complex

A growing number of machine learning architectures, such as Generative A...
research
12/03/2020

On the Impossibility of Convergence of Mixed Strategies with No Regret Learning

We study convergence properties of the mixed strategies that result from...
research
08/08/2022

Peer Prediction for Learning Agents

Peer prediction refers to a collection of mechanisms for eliciting infor...

Please sign up or login with your details

Forgot password? Click here to reset