Inverted Pendulum

The dataset used in the paper is an Inverted Pendulum dataset, which is a standard benchmark system in control and reinforcement learning.

BibTex: