Dataset - LDM

YFCC-100M

The dataset used in the paper is the YFCC-100M dataset, a large-scale image dataset.
- Dataset
- JSON
Visual Text Question Answering (VTQA)

A new challenge named Visual Text Question Answering (VTQA) along with a corresponding dataset, which includes 23,781 questions based on 10,124 image-text pairs.
- Dataset
- JSON
YFCC100M

The dataset used in the paper is YFCC100M, a large-scale video dataset. The dataset is used for foreground and background patch extraction and object recognition tasks.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

3 datasets found