You're currently viewing an old version of this dataset. To see the current version, click here.

GridWorld

The dataset is used to demonstrate the effectiveness of the Discovery of Deep Options (DDO) algorithm in accelerating reinforcement learning.

Data and Resources

Cite this as

Roy Fox, Sanjay Krishnan, Ion Stoica, Ken Goldberg (2025). Dataset: GridWorld. https://doi.org/10.57702/rpou27c3

DOI retrieved: January 2, 2025

Additional Info

Field Value
Created January 2, 2025
Last update January 2, 2025
Defined In https://doi.org/10.1088/1367-2630/ab783c
Citation
  • https://doi.org/10.48550/arXiv.1307.3195
  • https://doi.org/10.48550/arXiv.1703.08294
Author Roy Fox
More Authors
Sanjay Krishnan
Ion Stoica
Ken Goldberg
Homepage https://arxiv.org/abs/1706.08415