8 datasets found

Filter Results
  • Laion-20M

    The dataset used for pre-training the MS-CLIP model, which consists of 20 million image-text pairs filtered from Laion-400M.
  • W200M

    The dataset used in this paper is a large-scale web sourced image-text paired dataset.
  • LAION400M

    The dataset used in this paper is a large-scale web sourced image-text paired dataset.
  • CC3M

    The dataset used in this paper is a large-scale web sourced image-text paired dataset.
  • WebLI Dataset

    The WebLI dataset used for training and evaluation of the CoBIT model.
  • JFT-4B Dataset

    The JFT-4B dataset used for training and evaluation of the CoBIT model.
  • ALIGN Dataset

    The ALIGN dataset used for training and evaluation of the CoBIT model.
  • CoBIT Dataset

    The dataset used for training and evaluation of the CoBIT model, which consists of image-text pairs from large-scale noisy web-crawled data and image annotation data.