Image-Text Pairs - Groups

Laion-20M

The dataset used for pre-training the MS-CLIP model, which consists of 20 million image-text pairs filtered from Laion-400M.
- Dataset
- JSON
W200M

The dataset used in this paper is a large-scale web sourced image-text paired dataset.
- Dataset
- JSON
LAION400M

The dataset used in this paper is a large-scale web sourced image-text paired dataset.
- Dataset
- JSON
CC3M

The dataset used in this paper is a large-scale web sourced image-text paired dataset.
- Dataset
- JSON
WebLI Dataset

The WebLI dataset used for training and evaluation of the CoBIT model.
- Dataset
- JSON
JFT-4B Dataset

The JFT-4B dataset used for training and evaluation of the CoBIT model.
- Dataset
- JSON
ALIGN Dataset

The ALIGN dataset used for training and evaluation of the CoBIT model.
- Dataset
- JSON
CoBIT Dataset

The dataset used for training and evaluation of the CoBIT model, which consists of image-text pairs from large-scale noisy web-crawled data and image annotation data.
- Dataset
- JSON

8 datasets found