3 datasets found

Filter Results
  • GridWorld

    The dataset is used to demonstrate the effectiveness of the Discovery of Deep Options (DDO) algorithm in accelerating reinforcement learning.
  • GridWorld and BlockDude Domains

    The GridWorld and BlockDude domains were used to evaluate the proposed task sequencing framework.
  • Gridworld Dataset

    The dataset used for the Gridworld tasks, consisting of 10K episodes of the agent following a uniform random policy.