1 dataset found

Groups: Image Annotation Organizations: No Organization Formats: JSON

Filter Results
  • BLIP2

    A vision-language pre-training dataset, BLIP2, which consists of 100 million image-text pairs.