Dataset - LDM

BLIP2

A vision-language pre-training dataset, BLIP2, which consists of 100 million image-text pairs.
- Dataset
- JSON
COD10K

The COD10K dataset is currently the largest challenging dataset for COD, containing 10K images with dense annotations.
- Dataset
- JSON
DeepFashion2

DeepFashion2 is a large-scale fashion image benchmark with comprehensive tasks and annotations. It contains 491K images, each of which is richly labeled with style, scale,...
- Dataset
- JSON
KITTI dataset

The dataset used in the paper is the KITTI dataset, which is a benchmark for monocular depth estimation. The dataset consists of a large collection of images and corresponding...
- Dataset
- JSON
COCO

Large scale datasets [18, 17, 27, 6] boosted text conditional image generation quality. However, in some domains it could be difficult to make such datasets and usually it could...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

5 datasets found