Corridor Environment
The corridor environment is a simple environment where the agent has to determine whether the rewarding cell (colored yellow) is at the top or bottom, based on the color of the cell it has seen at the start (either ”blue” or ”red”).
BibTex: