MERMAIDE: Learning to Align Learners using Model-Based Meta-Learning

04/10/2023
by   Arundhati Banerjee, et al.
0

We study how a principal can efficiently and effectively intervene on the rewards of a previously unseen learning agent in order to induce desirable outcomes. This is relevant to many real-world settings like auctions or taxation, where the principal may not know the learning behavior nor the rewards of real people. Moreover, the principal should be few-shot adaptable and minimize the number of interventions, because interventions are often costly. We introduce MERMAIDE, a model-based meta-learning framework to train a principal that can quickly adapt to out-of-distribution agents with different learning strategies and reward functions. We validate this approach step-by-step. First, in a Stackelberg setting with a best-response agent, we show that meta-learning enables quick convergence to the theoretically known Stackelberg equilibrium at test time, although noisy observations severely increase the sample complexity. We then show that our model-based meta-learning approach is cost-effective in intervening on bandit agents with unseen explore-exploit strategies. Finally, we outperform baselines that use either meta-learning or agent behavior modeling, in both 0-shot and K=1-shot settings with partial agent information.

READ FULL TEXT

page 14

page 15

research
03/30/2018

Learning to Adapt: Meta-Learning for Model-Based Control

Although reinforcement learning methods can achieve impressive results i...
research
10/02/2020

Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning

Meta-learning is a powerful tool for learning policies that can adapt ef...
research
10/07/2022

Robotic Control Using Model Based Meta Adaption

In machine learning, meta-learning methods aim for fast adaptability to ...
research
01/27/2021

Multilingual and cross-lingual document classification: A meta-learning approach

The great majority of languages in the world are considered under-resour...
research
12/20/2019

Meta-Graph: Few shot Link Prediction via Meta Learning

Fast adaptation to new data is one key facet of human intelligence and i...
research
01/01/2022

Distributed Evolution Strategies Using TPUs for Meta-Learning

Meta-learning traditionally relies on backpropagation through entire tas...
research
03/17/2021

HyperDynamics: Meta-Learning Object and Agent Dynamics with Hypernetworks

We propose HyperDynamics, a dynamics meta-learning framework that condit...

Please sign up or login with your details

Forgot password? Click here to reset