Gridworld, Torus, and Four Rooms Environments

The dataset used in the paper is a set of environments with different topological properties, including a gridworld, a torus, and a four rooms environment. The agent is tasked with navigating to a fixed goal location from a fixed starting location, and once it has arrived at the goal location, stays there for the remainder of the episode.

BibTex: