Binary Tree MDP

The dataset used in the paper is a binary tree MDP, where the agent must execute a sequence of L uninterrupted UP movements. The dataset is used to test the Successor Uncertainties algorithm.

BibTex: