Bipedal Walker, Acrobot, and Continuous Lunar Lander tasks
The dataset used in this paper is a reinforcement learning benchmark problem, specifically the Bipedal Walker, Acrobot, and Continuous Lunar Lander tasks.
BibTex:
Before browse our site, please accept our cookies policy