-
Synthetic Visual Reasoning Test (SVRT)
The Synthetic Visual Reasoning Test (SVRT) dataset consists of 23 binary classification tasks, each defined by a particular configuration of relations. The tasks can be broadly... -
Abstract Reasoning Tasks (ART)
The Abstract Reasoning Tasks (ART) dataset was proposed by Webb et al. [51], consisting of four visual reasoning tasks, each defined by a different abstract rule. In the... -
Object-Centric Relational Abstraction
Human visual reasoning is characterized by an ability to identify abstract patterns from only a small number of examples, and to systematically generalize those patterns to... -
CLEVR-Robot Environment
A benchmark for evaluating task compositionality and long-horizon tasks through object manipulation, with language serving as the mechanism for goal specification. -
CLEVR dataset
The CLEVR dataset is a dataset for visual question answering, where each image is annotated with a question. -
Visual Transformation Telling (VTT)
The VTT dataset is a collection of instructional videos with annotated transformations. -
Abstraction and Reasoning Corpus (ARC)
A collection of heterogeneous visual reasoning data sets and an interesting benchmark for two reasons: First, visual reasoning programs tend to be large (in current program... -
Discrete-Valued Neural Communication
The dataset used in the paper is a visual reasoning task using Graph Neural Networks (GNNs) and Recurrent Independent Mechanisms (RIMs). The dataset consists of 8 Atari games... -
Visual Genome
The Visual Genome dataset is a large-scale visual question answering dataset, containing 1.5 million images, each with 15-30 annotated entities, attributes, and relationships.