Sample-Efficient Reinforcement Learning
of Undercomplete POMDPs
Chi Jin Sham M. Kakade
Princeton University University of Washington
chij@princeton.edu Microsoft Research, NYC
sham@cs.washington.edu
Akshay Krishnamurthy Qinghua Liu
Microsoft Research, NYC Princeton University
akshaykr@microsoft.com qinghual@princeton.edu
...
附件列表