-
Microsoft COCO 2017 dataset
This dataset contains images paired with multiple human-annotated descriptions in the form of sentences. -
icon45 dataset
The icon45 dataset is used for object detection and recognition. -
CityPersons and CrowdHuman
CityPersons and CrowdHuman are two benchmark datasets for pedestrian detection. CityPersons contains 35k person and 13k ignore region annotations, while CrowdHuman contains 340k... -
A Parallel Implementation of Computing Mean Average Precision
Mean Average Precision (mAP) has been widely used for evaluating the quality of object detectors, but an efficient implementation is still absent. Current implemen- -
Shapes and Shapes Rotation
The dataset used in the paper is a collection of sequences with patterns of different shapes and speeds, and a new dataset collected with the Baxter robot in a manipulation task... -
Depth Estimation and Object Detection
The dataset used for depth estimation and object detection. -
TinyPersons
The TinyPersons dataset is a small-scale dataset for object detection, consisting of images of tiny humans. -
VisDrone-MOT
The VisDrone-MOT dataset is a large-scale benchmark for multiple object tracking under drone scenes. -
Microsoft COCO object detection dataset
Microsoft COCO object detection dataset -
PASCAL VOC 2007 and PASCAL VOC 2012 object detection datasets
PASCAL VOC 2007 and PASCAL VOC 2012 object detection datasets -
ALFA: Agglomerative Late Fusion Algorithm for Object Detection
ALFA: Agglomerative Late Fusion Algorithm for Object Detection -
EPIC-KITCHENS
EPIC-KITCHENS is a large-scale egocentric video benchmark recorded by 32 participants in their native kitchen environments. Our videos depict non-scripted daily activities: we... -
BDD100K MOTS
The BDD100K MOTS dataset is a subset of the BDD100K dataset, containing 154 videos with annotation for training and validation, and 37 videos for testing. -
Densely Annotated Video Segmentation (DAVIS)
The Davis dataset contains fifty high-resolution videos with pixel-accurate ground truth.