Binary Tree MDP

The dataset used in the paper is a binary tree MDP, where the agent must execute a sequence of L uninterrupted UP movements. The dataset is used to test the Successor Uncertainties algorithm.

Data and Resources

Cite this as

David Janz, Jiri Hron, Przemysław Mazur, Katja Hofmann, José Miguel Hernández-Lobato, Sebastian Tschiatschek (2024). Dataset: Binary Tree MDP. https://doi.org/10.57702/jhkedzvk

DOI retrieved: December 3, 2024

Additional Info

Field Value
Created December 3, 2024
Last update December 3, 2024
Author David Janz
More Authors
Jiri Hron
Przemysław Mazur
Katja Hofmann
José Miguel Hernández-Lobato
Sebastian Tschiatschek
Homepage https://djanz.org/successor_uncertainties/tabular_code