ICML-Taylor Expansion of Discount Factors

收藏 2025-07-27

Taylor Expansions of Discount Factors

            Yunhao Tang 1 Mark Rowland 2 Remi Munos 3 Michal Valko 3

         Abstract             example, T could be the first time the MDP gets into a termi-
In practical reinforcement learning (RL), the dis-    nal state (e.g., a robot falls); when the MDP does not have a
count factor used for estimating value functions    natural terminal state, T could be enforced as a deterministic
often differs from that used for defining the ...

附件列表

ICML-Taylor Expansion of Discount Factors.pdf

大小:1.41 MB

只需: RMB 9 元马上下载

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

栏目导航

扫码加我 拉你入群

分享

扫码加好友，拉您进群

扫码加我拉你入群