-
CartPole-v1, LunarLander-v2, Maze Traversal
CartPole-v1, LunarLander-v2, Maze Traversal -
OpenAI Gym
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used several continuous control environments from the OpenAI Gym.