ICML-Emphatic Algorithms for Deep Reinforcement Learning

收藏 2025-08-10

Emphatic Algorithms for Deep Reinforcement Learning

            Ray Jiang 1 Tom Zahavy 1 Zhongwen Xu 1 Adam White 1 2
            Matteo Hessel 1 Charles Blundell 1 Hado van Hasselt 1

            Abstract             Many reinforcement learning (RL) agents learn off-policy
                              to some extent, to learn about the greedy policy while ex-
Off-policy learning allows us to learn about pos-    ploring (Watkins, 1989), to make predictions about policies
...

附件列表

ICML-Emphatic Algorithms for Deep Reinforcement Learning.pdf

大小:2.78 MB

只需: RMB 6 元马上下载

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

栏目导航

扫码加我 拉你入群

分享

扫码加好友，拉您进群

扫码加我拉你入群