Gridworld Dataset

The dataset used for the Gridworld tasks, consisting of 10K episodes of the agent following a uniform random policy.

BibTex: