GridWorld

The dataset is used to demonstrate the effectiveness of the Discovery of Deep Options (DDO) algorithm in accelerating reinforcement learning.

BibTex: