Grid World

The dataset used in the paper is a reinforcement learning dataset, specifically a Markov Decision Process (MDP) with a finite set of states and actions.

Data and Resources

Cite this as

Andrew Cohen, Lei Yu, Robert Wright (2025). Dataset: Grid World. https://doi.org/10.57702/smu7zene

DOI retrieved: January 2, 2025

Additional Info

Field Value
Created January 2, 2025
Last update January 2, 2025
Defined In https://doi.org/10.48550/arXiv.2109.07827
Citation
  • https://doi.org/10.48550/arXiv.1802.08331
Author Andrew Cohen
More Authors
Lei Yu
Robert Wright
Homepage https://arxiv.org/abs/1806.03492