Mountain Car, Acrobot, and Gridworld

The dataset used in the paper is a reinforcement learning dataset, specifically the Mountain Car and Acrobot problems, and a Gridworld problem.

BibTex: