ICML-On Reward-Free RL with Kernel and Neural Function Approximations Singl ...

收藏 2025-07-27

On Reward-Free RL with Kernel and Neural Function Approximations:
         Single-Agent MDP and Markov Game

            Shuang Qiu 1 Jieping Ye 1 Zhaoran Wang 2 Zhuoran Yang 3

            Abstract             is large and function approximators such as neural networks
To achieve sample efficiency in reinforcement       are employed. To achieve sample efficiency, any RL algo-
learning (RL), it necessitates to efficiently explore rithm needs to accurately learn the transit ...

附件列表

ICML-On Reward-Free RL with Kernel and Neural Function Approximations Single-Ag.pdf

大小:360.65 KB

只需: RMB 9 元马上下载

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

扫码加我 拉你入群

分享

扫码加好友，拉您进群

扫码加我拉你入群