-
Mountain Car
The dataset used in the paper is a reinforcement learning dataset, specifically a Markov Decision Process (MDP) with a finite set of states and actions. -
Mountain Car, Acrobot, and Gridworld
The dataset used in the paper is a reinforcement learning dataset, specifically the Mountain Car and Acrobot problems, and a Gridworld problem. -
Mountain Car and 4-dimensional Catcher
The dataset used in this paper is a reinforcement learning dataset, specifically the Mountain Car and 4-dimensional Catcher environments. -
OpenAI Gym
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used several continuous control environments from the OpenAI Gym.