We address the challenge of learning factored policies in cooperative MA...
Traditionally, off-policy learning algorithms (such as Q-learning) and
e...
This work develops a fully decentralized multi-agent algorithm for polic...
In this paper we develop a fully decentralized algorithm for policy
eval...