1 dataset found

Tags: reinforcement learning

Filter Results
  • Binary Tree MDP

    The dataset used in the paper is a binary tree MDP, where the agent must execute a sequence of L uninterrupted UP movements. The dataset is used to test the Successor...