全部版块 我的主页
论坛 提问 悬赏 求职 新闻 读书 功能一区 经管文库(原现金交易版)
102 0
2025-07-27
PsiPhi-Learning: Reinforcement Learning with Demonstrations using
      Successor Features and Inverse Temporal Difference Learning

    Angelos Filos 1 Clare Lyle 1 Yarin Gal 1 Sergey Levine 2 Natasha Jaques * 2 3 Gregory Farquhar * 4

              Abstract
   We study reinforcement learning (RL) with no-
   reward demonstrations, a setting in which an RL
   agent has access to additional data from the inter-
   action of other agents with the same environment.
   However, it has no access to  ...
附件列表
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

相关推荐
栏目导航
热门文章
推荐文章

说点什么

分享

扫码加好友,拉您进群
各岗位、行业、专业交流群