Bipedal Walker, Acrobot, and Continuous Lunar Lander tasks

The dataset used in this paper is a reinforcement learning benchmark problem, specifically the Bipedal Walker, Acrobot, and Continuous Lunar Lander tasks.

BibTex: