Policy Optimization Provably Converges to Nash
Equilibria in Zero-Sum Linear Quadratic Games
Kaiqing Zhang Zhuoran Yang
ECE and CSL ORFE
University of Illinois at Urbana-Champaign Princeton University
kzhang66@illinois.edu zy6@princeton.edu
Tamer Basar
ECE and CSL
University of Illinois at Urbana-Champaign
basar1@illin ...
附件列表