1 dataset found

Tags: human images

Filter Results
  • BLIP2

    A vision-language pre-training dataset, BLIP2, which consists of 100 million image-text pairs.
You can also access this registry using the API (see API Docs).