Image Annotation - Groups

XView2 dataset

The XView2 dataset is a large set of satellite overhead images annotated.
- Dataset
- JSON
FLICKR-125K

The FLICKR-125K dataset is a large-scale image annotation dataset.
- Dataset
- JSON
FLICKR-60K

The FLICKR-60K dataset is a large-scale image annotation dataset.
- Dataset
- JSON
ESP-GAME

The ESP-GAME dataset is a benchmark for image annotation tasks.
- Dataset
- JSON
IAPRTC-12

The IAPRTC-12 dataset is a benchmark dataset for visual information systems. It contains 17495 images with 291 candidate classes.
- Dataset
- JSON
Fluid Annotation: A Human-Machine Collaboration Interface for Full Image Anno...

Fluid Annotation is an intuitive human-machine collaboration interface for annotating the class label and outline of every object and background region in an image.
- Dataset
- JSON
COCO panoptic dataset

The COCO panoptic dataset combines the original COCO dataset with COCO-stuff, merging some stuff classes based on [32]. It contains 118K training and 5K validation images...
- Dataset
- JSON
Corel5k

The dataset used in this paper for image annotation, consisting of 4,999 annotated images with a vocabulary of up to 200 keywords.
- Dataset
- JSON
SPMDataset

A dataset of images annotated with semantic tuples, including predicates, actors, and locatives.
- Dataset
- JSON
Microsoft COCO 2017 dataset

This dataset contains images paired with multiple human-annotated descriptions in the form of sentences.
- Dataset
- JSON
Heudiasyc dataset

A dataset for autonomous driving.
- Dataset
- JSON
ApolloScape Dataset

The ApolloScape dataset is a large-scale dataset for autonomous driving, containing images and annotations.
- Dataset
- JSON
BLIP2

A vision-language pre-training dataset, BLIP2, which consists of 100 million image-text pairs.
- Dataset
- JSON
Inria Aerial Image Labeling dataset

Inria Aerial Image Labeling dataset contains aerial orthorectified color imagery of 5000 × 5000 pixels with a spatial resolution of 0.3 m.
- Dataset
- JSON
AICrowd Mapping Challenge dataset

AICrowd Mapping Challenge dataset contains 300 × 300 pixels RGB images and corresponding annotations in MS-COCO format.
- Dataset
- JSON
MSRC

The dataset used in the paper is a multi-view clustering dataset, which contains 6 views of 2000 samples each. The dataset is used to evaluate the performance of the proposed...
- Dataset
- JSON
ReferItGame

Visual grounding is the task of localizing a language query in an image. The output is often a bounding box as drawn in the yellow color.
- Dataset
- JSON
Flickr30K Entities

The Flickr30K Entities dataset consists of 31,783 images each matched with 5 captions. The dataset links distinct sentence entities to image bounding boxes, resulting in 70K...
- Dataset
- JSON
Broden

The dataset used in the paper is Broden, a dataset containing pixel-level concept annotations.
- Dataset
- JSON
Pothole Detection Dataset

A dataset of images with pothole annotations from various sources, including Google Earth Pro, AUTOPILOT videos, and GoPro camera images.
- Dataset
- JSON

27 datasets found