-
Synthetic Visual Reasoning Test (SVRT)
The Synthetic Visual Reasoning Test (SVRT) dataset consists of 23 binary classification tasks, each defined by a particular configuration of relations. The tasks can be broadly... -
CLEVR-Robot Environment
A benchmark for evaluating task compositionality and long-horizon tasks through object manipulation, with language serving as the mechanism for goal specification.