-
Multi-Stage Gridworld
The dataset used in the paper is the Multi-Stage Gridworld environment, which is a 3D environment that requires the agent to navigate through a gridworld to find a goal. -
Treasure Mountain
The dataset used in the paper is the Treasure Mountain environment, which is a 2D environment that requires the agent to navigate through a mountain to find a treasure.