Visual Reasoning - Groups

MathVista

MathVista is a benchmark for evaluating mathematical reasoning in visual contexts.

Dataset
JSON

CoGenT

Inferring and executing programs for visual reasoning.

Dataset
JSON

COG

A dataset and architecture for visual reasoning with a working memory.

Dataset
JSON

VQA-CP

The VQA-CP dataset is a split of the VQA dataset, designed to test generalization skills across changes in the answer distribution between the training and the test sets.

Dataset
JSON

CLEVR-ART

The CLEVR-ART dataset is a novel dataset consisting of ART problems rendered using more realistic 3D shapes, based on the CLEVR dataset.

Dataset
JSON

Synthetic Visual Reasoning Test (SVRT)

The Synthetic Visual Reasoning Test (SVRT) dataset consists of 23 binary classification tasks, each defined by a particular configuration of relations. The tasks can be broadly...

Dataset
JSON

Abstract Reasoning Tasks (ART)

The Abstract Reasoning Tasks (ART) dataset was proposed by Webb et al. [51], consisting of four visual reasoning tasks, each defined by a different abstract rule. In the...

Dataset
JSON

Object-Centric Relational Abstraction

Human visual reasoning is characterized by an ability to identify abstract patterns from only a small number of examples, and to systematically generalize those patterns to...

Dataset
JSON

CLEVR-Robot Environment

A benchmark for evaluating task compositionality and long-horizon tasks through object manipulation, with language serving as the mechanism for goal specification.

Dataset
JSON

CLEVR dataset

The CLEVR dataset is a dataset for visual question answering, where each image is annotated with a question.

Dataset
JSON

Visual Transformation Telling (VTT)

The VTT dataset is a collection of instructional videos with annotated transformations.

Dataset
JSON

NLVR2

The dataset used in the paper is a set of sequential vision-and-language tasks, where each task consists of an image and a text input.

Dataset
JSON

Abstraction and Reasoning Corpus (ARC)

A collection of heterogeneous visual reasoning data sets and an interesting benchmark for two reasons: First, visual reasoning programs tend to be large (in current program...

Dataset
JSON

Discrete-Valued Neural Communication

The dataset used in the paper is a visual reasoning task using Graph Neural Networks (GNNs) and Recurrent Independent Mechanisms (RIMs). The dataset consists of 8 Atari games...

Dataset
JSON

Visual Genome

The Visual Genome dataset is a large-scale visual question answering dataset, containing 1.5 million images, each with 15-30 annotated entities, attributes, and relationships.

Dataset
JSON

15 datasets found