OpenAI Gym’s Mujoco benchmark

The dataset used in this paper is a set of demonstrations for reinforcement learning, containing safe and unsafe trajectories.

BibTex: