Computer Vision - Groups

KITTI tracking dataset

KITTI tracking dataset provides 21 training and 29 test sequences. The dataset provides 2D bounding box annotations for cars, pedestrians, and 6 other classes, but only the...

Dataset
JSON

Physics-aware Simulation for Object Detection and Pose Estimation

This paper proposes a dataset generation pipeline that uses physics simulation to generate images of objects in cluttered scenes.

Dataset
JSON

Imagenette

The Imagenette dataset used in the paper for class density and dataset quality in high-dimensional, unstructured data.

Dataset
JSON

Caltech

The Caltech dataset is a benchmark for pedestrian detection, consisting of approximately 10 hours of video taken from a vehicle driving through Los Angeles.

Dataset
JSON

VIMER-UFO Benchmark

The VIMER-UFO benchmark dataset consists of 8 computer vision tasks: CPLFW, Market1501, DukeMTMC, MSMT-17, Veri-776, VehicleId, VeriWild, and SOP.

Dataset
JSON

Content-Aware Convolutional Neural Networks

Convolutional Neural Networks (CNNs) have achieved great success due to the powerful feature learning ability of convolution layers. Speciﬁcally, the standard convolution...

Dataset
JSON

Places205

Places205 is a dataset of 2.5 million images from 205 categories, with 12,000 images per category.

Dataset
JSON

Open Images Dataset

The dataset used in the experiment consists of 50 images equally distributed between five classes: aircraft, bird, bicycle, boat, and dog. Each class has 5 true positive images...

Dataset
JSON

Training dataset generation for bridge game registration

The proposed method of automatic dataset generation for cards detection and classification makes it possible to obtain any number of images of any size, which can be used to...

Dataset
JSON

Argoverse: 3D tracking and forecasting with rich maps

The Argoverse dataset includes 65 training and 24 validation sequences recorded in Miami and Pittsburgh.

Dataset
JSON

ImageNet Large Scale Visual Recognition Challenge (ILSVRC)

The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) dataset is a large-scale image classification dataset containing over 14 million images from 21,841 categories.

Dataset
JSON

Amazon Picking Challenge 2016 Dataset

The dataset used in the Amazon Picking Challenge 2016, a vision-based robotic picking system developed by Team Applied Robotics.

Dataset
JSON

ScanNet200

Diff2Scene uses ScanNet, Matterport3D, ScanNet200 and Replica for open-vocabulary 3D semantic segmentation and visual grounding tasks.

Dataset
JSON

PASCAL Visual Object Classes Challenge

The PASCAL Visual Object Classes Challenge (VOC) is a benchmark dataset for object detection and semantic segmentation.

Dataset
JSON

COCO object detection and instance segmentation, ADE20K semantic segmentation

The dataset used in the paper is the COCO object detection and instance segmentation dataset, and the ADE20K semantic segmentation dataset.

Dataset
JSON

Object Tracking Benchmark

The OTB100 dataset is an extension of the OTB50 dataset, containing 100 videos with 1000 frames each.

Dataset
JSON

MIT-67, CUB-2011, Caltech-101, DTD

MIT-67 is a dataset of 67 indoor scenes, CUB-2011 is a dataset of 200 bird species, Caltech-101 is a dataset of 101 objects, and DTD is a dataset of 47 textures.

Dataset
JSON

V-COCO

The V-COCO dataset contains 2,533 training images, 2,867 validation images, and 4,946 test images, including 24 action classes.

Dataset
JSON

Submanifold Sparse Convolutional Networks

Convolutional network are the de-facto standard for analysing spatio-temporal data such as images, videos, 3D shapes, etc. Whilst some of this data is naturally dense (for...

Dataset
JSON

MS COCO dataset

The MS COCO dataset is a large benchmark for image captioning, containing 328K images with 5 caption descriptions each.

Dataset
JSON

42 datasets found