Learning in Cournot Games with Limited Information Feedback

06/15/2019
by   Yuanyuan Shi, et al.
0

In this work, we study the interaction of strategic players in continuous action Cournot games with limited information feedback. Cournot game is the essential market model for many socio-economic systems where players learn and compete without the full knowledge of the system or each other. In this setting, it becomes important to understand the dynamics and limiting behavior of the players. We consider concave Cournot games and two widely used learning strategies in the form of no-regret algorithms and policy gradient. We prove that if the players all adopt one of the two algorithms, their joint action converges to the unique Nash equilibrium of the game. Notably, we show that myopic algorithms such as policy gradient can achieve exponential convergence rate, while no-regret algorithms only converge (sub)linearly. Together, our work presents significantly sharper convergence results and shows how exploiting the structure of the game can lead to much faster convergence rates.

READ FULL TEXT
research
09/14/2020

Multi-Agent Reinforcement Learning in Cournot Games

In this work, we study the interaction of strategic agents in continuous...
research
10/18/2021

Game Redesign in No-regret Game Playing

We study the game redesign problem in which an external designer has the...
research
07/08/2022

Online Learning in Supply-Chain Games

We study a repeated game between a supplier and a retailer who want to m...
research
10/03/2018

Bandit learning in concave N-person games

This paper examines the long-run behavior of learning with bandit feedba...
research
06/19/2023

Taming the Exponential Action Set: Sublinear Regret and Fast Convergence to Nash Equilibrium in Online Congestion Games

The congestion game is a powerful model that encompasses a range of engi...
research
06/13/2022

No-Regret Learning in Games with Noisy Feedback: Faster Rates and Adaptivity via Learning Rate Separation

We examine the problem of regret minimization when the learner is involv...
research
06/27/2023

Semi Bandit Dynamics in Congestion Games: Convergence to Nash Equilibrium and No-Regret Guarantees

In this work, we introduce a new variant of online gradient descent, whi...

Please sign up or login with your details

Forgot password? Click here to reset