-
CartPole, Pendulum, and LunarLander
The dataset used in the paper is a set of environments for reinforcement learning, including CartPole, Pendulum, and LunarLander. -
Cart-Pole, Pendulum, and Cart-Pole Balance Environments
The dataset used in this paper is a set of three classic control environments: cart-pole swing-up (CPSU), pendulum swing-up (PSU), and cart-pole balance (CPB). -
Pendulum and Reacher
The Pendulum swing-up task, the agent tries to keep the pendulum upright and balanced under the constraint of keeping away from unsafe angles. In the Reacher task, the robotic...