Learning Generalized Policy Automata for Relational Stochastic Shortest Path Problems
The dataset used in this paper is a set of small SSP instances with small object counts, used to learn Generalized Policy Automata (GPAs) for solving larger related SSPs.
BibTex: