5 datasets found

Tags: annotations

Filter Results
  • BLIP2

    A vision-language pre-training dataset, BLIP2, which consists of 100 million image-text pairs.
  • COD10K

    The COD10K dataset is currently the largest challenging dataset for COD, containing 10K images with dense annotations.
  • DeepFashion2

    DeepFashion2 is a large-scale fashion image benchmark with comprehensive tasks and annotations. It contains 491K images, each of which is richly labeled with style, scale,...
  • KITTI dataset

    The dataset used in the paper is the KITTI dataset, which is a benchmark for monocular depth estimation. The dataset consists of a large collection of images and corresponding...
  • COCO

    Large scale datasets [18, 17, 27, 6] boosted text conditional image generation quality. However, in some domains it could be difficult to make such datasets and usually it could...
You can also access this registry using the API (see API Docs).