Grid World

doi:doi:10.57702/smu7zene

Grid World

The dataset used in the paper is a reinforcement learning dataset, specifically a Markov Decision Process (MDP) with a finite set of states and actions.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Andrew Cohen, Lei Yu, Robert Wright (2025). Dataset: Grid World. https://doi.org/10.57702/smu7zene

DOI retrieved: January 2, 2025

Additional Info

Field	Value
Created	January 2, 2025
Last update	January 2, 2025
Defined In	https://doi.org/10.48550/arXiv.2109.07827
Citation	https://doi.org/10.48550/arXiv.1802.08331
Author	Andrew Cohen
More Authors	Lei Yu Robert Wright
Homepage	https://arxiv.org/abs/1806.03492