ICML-Off-Policy Confidence Sequences

收藏 2025-07-28

Off-Policy Confidence Sequences

            Nikos Karampatziakis 1 Paul Mineiro 2 Aaditya Ramdas 3

         Abstract                that the probability that they ever exclude the true value is
                              bounded by a prespecified quantity. In other words, they
We develop confidence bounds that hold uni-       retain validity under optional (early) stopping and optional
formly over time for off-policy evaluation in the    continuation (collecting more dat ...

附件列表

ICML-Off-Policy Confidence Sequences.pdf

大小:680.4 KB

只需: RMB 9 元马上下载

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

栏目导航

扫码加我 拉你入群

分享

扫码加好友，拉您进群

扫码加我拉你入群