-
MuJoCo Benchmark
The dataset used in the paper is the MuJoCo benchmark, which is a collection of robotic manipulation tasks. -
OpenAI Gym
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used several continuous control environments from the OpenAI Gym.