Dataset - LDM

Visual Question Answering (VQA)

The VQA dataset consists of 248,349 training questions, 121,512 validation questions and 244,302 testing questions, generated on a total of 123,287 images.
- Dataset
- JSON
MNRE-2

MNRE-2 dataset
- Dataset
- JSON
WikiText-103 and Enwik8 datasets

WikiText-103 and Enwik8 datasets are used for language modeling tasks
- Dataset
- JSON
Paper-Author

Paper-Author: This dataset contains papers crawled from the arXiv preprint database. Nodes U represent papers, while nodes V represent authors. An edge ⟨u, v⟩ indicates that the...
- Dataset
- JSON
AGNews

The dataset used in the paper is not explicitly described, but it is mentioned that the authors used a variety of datasets for semi-supervised learning tasks.
- Dataset
- JSON
Multimodal Attribute Extraction (MAE) dataset

The Multimodal Attribute Extraction (MAE) dataset is a large dataset containing mixed-media data for over 2.2 million commercial product items, collected from a large number of...
- Dataset
- JSON
EIT-1M

A large-scale multi-modal dataset comprising 1 million EEG-image-text pairs.
- Dataset
- JSON
Equity Evaluation Corpus (EEC)

The dataset used in the paper is the Equity Evaluation Corpus (EEC) for emotion prediction, which contains a balanced dataset of sentences with emotions.
- Dataset
- JSON
CLIPfa

The CLIPfa dataset is a multilingual image-text dataset.
- Dataset
- JSON
SemEval-2023 Task 1: Visual Word Sense Disambiguation

The SemEval-2023 Visual Word Sense Disambiguation (V-WSD) Task dataset consists of a silver dataset with 12,869 V-WSD instances. Each sample is a 4-tuple ⟨f, c, I, i∗ ∈ I⟩ where...
- Dataset
- JSON
SemEval-2017 Task 4

The SemEval-2017 Task 4 dataset consists of tweets with sentiment labels.
- Dataset
- JSON
Sherlock

The Sherlock dataset contains 103K images collected from the Visual Genome and Visual Common Sense Reasoning datasets. These images are split into 90K training, 6.6K validation,...
- Dataset
- JSON
VQA

The VQA dataset is a large-scale visual question answering dataset that consists of pairs of images that require natural language answers.
- Dataset
- JSON
Magazine

Magazine: This dataset contains Amazon Aeviews Data under the category of Magazine Subscriptions. We randomly sampled 100, 000 records and removed nodes with degrees lower than...
- Dataset
- JSON
OpenSubtitles dataset

Open-domain neural dialogue generation (Vinyals and Le, 2015; Sordoni et al., 2015; Li et al., 2016a; Mou et al., 2016; Serban et al., 2016a; Asghar et al., 2016; Mei et al.,...
- Dataset
- JSON
Schizophrenia Spectrum Dataset

The dataset used for this study was collected for a mental health assessment project conducted at the University of Maryland School of Medicine in collaboration with the...
- Dataset
- JSON
The KIT Motion-Language Dataset

The KIT Motion-Language Dataset consists of 3,911 motion sequences with 12.5 FPS and 6,278 language annotations.
- Dataset
- JSON
UCM

Remote sensing image-text retrieval dataset
- Dataset
- JSON
RSITMD

Remote sensing image-text retrieval dataset
- Dataset
- JSON
RSICD

The RSICD dataset is a benchmark remote sensing text-image dataset. It contains a total of 10921 aerial remote sensing images with various resolutions collected from Google...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

31 datasets found