计算视觉Simultaneously Learning Stochastic and Adversarial Episodic MDPs with K ...

Barda-2025

收藏 2025-08-10

Simultaneously Learning Stochastic and Adversarial
Episodic MDPs with Known Transition

      Tiancheng Jin                Haipeng Luo
   University of Southern California       University of Southern California
   tiancheng.jin@usc.edu             haipengl@usc.edu

                     Abstract
   This work studies the problem of learning episodic Markov Decision Processes
   with known transition and bandit feedback. We develop the first algorithm with a
   “best-o ...

附件列表

计算视觉Simultaneously Learning Stochastic and Adversarial Episodic MDPs with Kn.pdf

大小:317.24 KB

只需: RMB 9 元马上下载

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

栏目导航

扫码加我 拉你入群

分享

扫码加好友，拉您进群

扫码加我拉你入群