Dataset - LDM

ImageNet with Adversarial Text Regions

The ImageNet with Adversarial Text Regions (ImageNet-Atr) dataset is a new evaluation set built by adding spotting words to the images of ImageNet evaluation sets.
- Dataset
- JSON
YFCC15M-V2

The dataset is used for Contrastive Language-Image Pretraining (CLIP) and its variants.
- Dataset
- JSON
YFCC15M-V1

The dataset is used for Contrastive Language-Image Pretraining (CLIP) and its variants.
- Dataset
- JSON
YFCC15M

Mid-scale 15M data is a good balance of the training cost and performance. The dataset is used for Contrastive Language-Image Pretraining (CLIP) and its variants.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

4 datasets found