-
DataComp-1B
The dataset used in the paper is also DataComp-1B, which is a large-scale dataset for training next-generation image-text models. -
LAION-400M and LAION-5B
The dataset used in the paper is LAION-400M and LAION-5B, which are large-scale datasets for training next-generation image-text models. -
DataCompDR
The dataset used for CLIP pretraining with good quality captions.