ICML-Accelerating Safe Reinforcement Learning with Constraint-mismatched Ba ...

收藏 2025-08-11

Accelerating Safe Reinforcement Learning
            with Constraint-mismatched Baseline Policies

      Tsung-Yen Yang 1 Justinian Rosca 2 Karthik Narasimhan 1 Peter J. Ramadge 1

         Abstract                or other costs. For instance, when you drive an unfamiliar
                              vehicle, you do so cautiously to ensure safety, while adapt-
We consider the problem of reinforcement learn-
                              ing your driving technique to the ve ...

附件列表

ICML-Accelerating Safe Reinforcement Learning with Constraint-mismatched Baselin.pdf

大小:5.28 MB

只需: RMB 6 元马上下载

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

扫码加我 拉你入群

分享

扫码加好友，拉您进群

扫码加我拉你入群