The Mean-Squared Error of Double Q-Learning
Wentao Weng Harsh Gupta
Tsinghua University University of Illinois at Urbana-Champaign
wwt17@mails.tsinghua.edu.cn hgupta10@illinois.edu
Niao He Lei Ying
University of Illinois at Urbana-Champaign University of Michigan, Ann Arbor
niaohe@illinois.edu leiying@umich.edu
R. Srikant
University of Illi ...
附件列表