3 datasets found

Tags: pre-training

Filter Results
  • DataComp-1B

    The dataset used in the paper is also DataComp-1B, which is a large-scale dataset for training next-generation image-text models.
  • LAION-400M and LAION-5B

    The dataset used in the paper is LAION-400M and LAION-5B, which are large-scale datasets for training next-generation image-text models.
  • CLIP

    The CLIP model and its variants are becoming the de facto backbone in many applications. However, training a CLIP model from hundreds of millions of image-text pairs can be...