Neuro-algorithmic Policies Enable Fast Combinatorial Generalization
Marin Vlastelica 1 , Michal Rolínek 1 Georg Martius 1
input representation Dijkstra's shortest path predicted Hamming expert
t+2
2 frames learning t+1 trajectory distance trajectory
t
C
?
cost- c1
p ...
附件列表