ICML-Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm

收藏 2025-07-28

Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm

            Sajad Khodadadian 1 Zaiwei Chen 2 Siva Theja Maguluri 1

         Abstract                An AC algorithm can be thought as a generalized policy iter-
In this paper, we provide finite-sample conver-    ation (Puterman, 1995), and consists of two phases, namely
gence guarantees for an off-policy variant of the    actor and critic. The objective of the actor is to improve the
natural actor-crit ...

附件列表

ICML-Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm.pdf

大小:508.18 KB

只需: RMB 9 元马上下载

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

栏目导航

扫码加我 拉你入群

分享

扫码加好友，拉您进群

扫码加我拉你入群