-
Bipedal Walker, Acrobot, and Continuous Lunar Lander tasks
The dataset used in this paper is a reinforcement learning benchmark problem, specifically the Bipedal Walker, Acrobot, and Continuous Lunar Lander tasks. -
Mountain Car, Acrobot, and Gridworld
The dataset used in the paper is a reinforcement learning dataset, specifically the Mountain Car and Acrobot problems, and a Gridworld problem. -
OpenAI Gym
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used several continuous control environments from the OpenAI Gym.