Weighted QMIX: Expanding Monotonic Value
Function Factorisation for Deep Multi-Agent
Reinforcement Learning
Tabish Rashid, Gregory Farquhar, Bei Peng, Shimon Whiteson
Department of Computer Science
University of Oxford
{tabish.rashid, gregory.farquhar, bei.peng, shimon.whiteson}@cs.ox.ac.uk
Abstract
QMIX is a popular Q-learning algorithm for cooperative MARL in the centralised
training and decentralise ...
附件列表