CartPole, Pendulum, and LunarLander

The dataset used in the paper is a set of environments for reinforcement learning, including CartPole, Pendulum, and LunarLander.

BibTex: