-
MSCOCO dataset
The MSCOCO dataset is a large-scale image captioning dataset, containing 113,287 images with 5,000 validation images and 5,000 test images. The dataset is used for training and... -
ProgS-RCNN
ProgS-RCNN: Progressive End-to-End Object Detection in Crowded Scenes -
CrowdHuman
CrowdHuman is a challenging benchmark to evaluate the ability of crowded scene detection of detectors, which contains about 15k training images and 4k images for evaluation. -
VGG-16 Dataset
The VGG-16 dataset is a large collection of images of objects. -
COCO test-dev
The COCO test-dev dataset is used for instance segmentation. It contains 20k test-dev images. -
PASCAL VOC 2007 dataset
PASCAL VOC 2007 dataset is a widely used dataset for object detection and semantic segmentation. We use all the split sets (training, validation, testing) in the VOC2007 dataset... -
Objects365
The Objects365 dataset is a large-scale object detection dataset containing 365,000 images with 365 categories. -
LabelMe dataset
The LabelMe dataset is a natural scene dataset used for testing the performance of the IBTM model on image classification tasks. -
VOC-Mixture
The dataset used in the paper for semi-supervised object detection tasks. -
COCO-Standard
The dataset used in the paper for semi-supervised object detection tasks. -
VIRAT dataset
A video dataset for frame duplication detection and localization in forged videos