ICML-Exponential Lower Bounds for Batch Reinforcement Learning Batch RL can ...

收藏 2025-08-10

Exponential Lower Bounds for Batch Reinforcement Learning:
      Batch RL can be Exponentially Harder than Online RL

                        Andrea Zanette 1

         Abstract             we consider two classical batch RL problems: 1) the off-
Several practical applications of reinforcement    policy evaluation (OPE) problem, where the batch algo-
learning involve an agent learning from past data    rithm needs to predict the performance of a target policy
without th ...

附件列表

ICML-Exponential Lower Bounds for Batch Reinforcement Learning Batch RL can be .pdf

大小:547.14 KB

只需: RMB 9 元马上下载

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

栏目导航

扫码加我 拉你入群

分享

扫码加好友，拉您进群

扫码加我拉你入群