-
Corridor Environment
The corridor environment is a simple environment where the agent has to determine whether the rewarding cell (colored yellow) is at the top or bottom, based on the color of the... -
Reconnaissance Blind TicTacToe
The Reconnaissance Blind TicTacToe (RBT) dataset is a variation of the Reconnaissance Blind Chess (RBC) challenge. It is a game of TicTacToe where the agent cannot see the moves... -
POAPS Program for Iterative Improvement
A POAPS program for iterative improvement on descriptions for images -
POAPS Program for Labeling
A POAPS program for labeling that manages uncertainty without exposing it to the user -
A Programming Language With a POMDP Inside
A programming language with a POMDP inside