ICML-Preferential Temporal Difference Learning

收藏 2025-07-27

Preferential Temporal Difference Learning

                  Nishanth Anand 1 2 Doina Precup 1 2 3

         Abstract                TD-learning can be viewed as a way to approximate dy-
Temporal-Difference (TD) learning is a general    namic programming algorithms in Markovian environ-
and very useful tool for estimating the value func-    ments (Barnard, 1993). But, if the Markovian assumption
tion of a given policy, which in turn is required    does not hold (as is  ...

附件列表

ICML-Preferential Temporal Difference Learning.pdf

大小:6.93 MB

只需: RMB 9 元马上下载

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

栏目导航

扫码加我 拉你入群

分享

扫码加好友，拉您进群

扫码加我拉你入群