The LoCA Regret A Consistent Metric to Evaluate Model-Based Behavior in Rei ...

Kaka-2030

收藏 2025-08-08

The LoCA Regret: A Consistent Metric to Evaluate
  Model-Based Behavior in Reinforcement Learning

   Harm van Seijen1 , Hadi Nekoei2 , Evan Racah2 , Sarath Chandar2,3,4
         1
            Microsoft Research Montréal, 2 Mila - Quebec AI Institute,
         3
         cole Polytechnique de Montréal, 4 Canada CIFAR AI Chair

                     Abstract
   Deep model-based Reinforcement Learning (RL) has the potential to substantially
   improve the sample-efficiency of ...

附件列表

The LoCA Regret A Consistent Metric to Evaluate Model-Based Behavior in Reinfor.pdf

大小:886.87 KB

只需: RMB 9 元马上下载

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

栏目导航

扫码加我 拉你入群

分享

扫码加好友，拉您进群

扫码加我拉你入群