ICML-Towards Tight Bounds on the Sample Complexity of Average-reward MDPs

收藏 2025-07-27

Towards Tight Bounds on the Sample Complexity
               of Average-reward MDPs

                        Yujia Jin 1 Aaron Sidford 1

            Abstract             making under uncertainty and reinforcement learning (Puter-
                              man, 2014; Sutton & Barto, 2018). It is a prominent theoret-
We prove new upper and lower bounds for sample    ical test-bed for learning algorithms and has been studied ex-
complexity of finding an -optimal poli ...

附件列表

ICML-Towards Tight Bounds on the Sample Complexity of Average-reward MDPs.pdf

大小:1.88 MB

只需: RMB 9 元马上下载

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

栏目导航

扫码加我 拉你入群

分享

扫码加好友，拉您进群

扫码加我拉你入群