Dataset - LDM

Visual Question Answering (VQA)

The VQA dataset consists of 248,349 training questions, 121,512 validation questions and 244,302 testing questions, generated on a total of 123,287 images.
- Dataset
- JSON
VQAv2 dataset

The VQAv2 dataset, containing open-ended questions on 265k images, with 5.4 questions per image on average.
- Dataset
- JSON
MNRE-2

MNRE-2 dataset
- Dataset
- JSON
Waymo Open Perception

The Waymo Open Perception dataset is a large-scale dataset for autonomous driving perception.
- Dataset
- JSON
MINIST

MINIST: This is a subset of the MINST hand-written digits dataset, created for the outlier detection task in Outlier Detection DataSets. It contains a total of 7603 images, with...
- Dataset
- JSON
Multimodal Attribute Extraction (MAE) dataset

The Multimodal Attribute Extraction (MAE) dataset is a large dataset containing mixed-media data for over 2.2 million commercial product items, collected from a large number of...
- Dataset
- JSON
Visual Genome Relationship Dataset

The Visual Genome Relationship Dataset contains 108,077 images and 1,531,448 relationships.
- Dataset
- JSON
Visual Relationship Dataset

The Visual Relationship Dataset contains 5000 images with 100 object categories and 70 predicates.
- Dataset
- JSON
EIT-1M

A large-scale multi-modal dataset comprising 1 million EEG-image-text pairs.
- Dataset
- JSON
CLIPfa

The CLIPfa dataset is a multilingual image-text dataset.
- Dataset
- JSON
SemEval-2023 Task 1: Visual Word Sense Disambiguation

The SemEval-2023 Visual Word Sense Disambiguation (V-WSD) Task dataset consists of a silver dataset with 12,869 V-WSD instances. Each sample is a 4-tuple ⟨f, c, I, i∗ ∈ I⟩ where...
- Dataset
- JSON
STERE

The dataset used in the paper for RGB-D salient object detection.
- Dataset
- JSON
NLPR

The dataset used in the paper for RGB-D salient object detection.
- Dataset
- JSON
STEX

Texture image dataset
- Dataset
- JSON
ALOT

Texture image dataset
- Dataset
- JSON
VisTex

VisTex dataset contains color texture images, representative of real world conditions.
- Dataset
- JSON
NIST SD27

The dataset used in the paper is a high-quality fingerprint image dataset (plain and rolled print) and a poor-quality fingerprint image dataset (latent print).
- Dataset
- JSON
Geometrical Illusions Dataset

The dataset is a collection of images used to study Geometrical illusions.
- Dataset
- JSON
Visual Cortex Dataset

The dataset is a collection of images used to study the visual cortex.
- Dataset
- JSON
Hyperspectral pasture image dataset

Hyperspectral pasture image dataset with imbalanced class distributions and disparate volumes of data among different sites
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

74 datasets found