Grid World

Grid World

The dataset used in the paper is a reinforcement learning dataset, specifically a Markov Decision Process (MDP) with a finite set of states and actions.
- Dataset
- JSON
Grid-world task

The dataset used in this paper is the Grid-world task, which is a simple grid-based environment. The dataset is used to evaluate the performance of the Self-correcting...
- Dataset
- JSON

Before browse our site, please accept our cookies policy

2 datasets found