-
DDAD dataset
The DDAD dataset is a new autonomous driving benchmark from Toyota Research Institute for long-range (up to 250m). -
MSDC-Net: Multi-Scale Dense and Contextual Networks for Automated Disparity M...
Disparity prediction from stereo images is essential to computer vision applications including autonomous driving, 3D model reconstruction, and object detection. -
Cityscapes
The Cityscapes dataset is a large and famous city street scene semantic segmentation dataset. 19 classes of which 30 classes of this dataset are considered for training and... -
KITTI dataset
The dataset used in the paper is the KITTI dataset, which is a benchmark for monocular depth estimation. The dataset consists of a large collection of images and corresponding... -
ShapeNetCore
The ShapeNetCore dataset is a large-scale 3D model dataset, containing 44,000 3D models and 13 categories. -
CIFAR-10, CIFAR-100, and ImageNet
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used CIFAR-10, CIFAR-100, and ImageNet datasets. -
Bollywood dataset
The Bollywood dataset is a collection of images of Bollywood celebrities with varying body mass indexes (BMIs). The dataset is used for face-to-BMI prediction. -
Microsoft COCO
The Microsoft COCO dataset was used for training and evaluating the CNNs because it has become a standard benchmark for testing algorithms aimed at scene understanding and... -
ImageNet Large Scale Visual Recognition Challenge
A benchmark for low-shot recognition was proposed by Hariharan & Girshick (2017) and consists of a representation learning phase without access to the low-shot classes and a... -
KITTI 2015
The KITTI 2015 dataset is a real-world dataset of street views, containing 200 training stereo image pairs with sparsely labeled disparity from LiDAR data. -
Scene Flow
Stereo matching aims to recover the dense reconstruction of unknown scenes by computing the disparity from rectified stereo images, helping robots intelligently interact with... -
FPDeep: Scalable Acceleration of CNN Training on Deeply-Pipelined FPGA Clusters
The dataset used in this paper is a CNN training dataset, specifically VGG-16, VGG-19, and AlexNet. -
FusionT-LESS
Sensor fusion can significantly improve the performance of many computer vision tasks. However, traditional fusion approaches are either not data-driven and cannot exploit prior... -
FusionCelebA
Sensor fusion can significantly improve the performance of many computer vision tasks. However, traditional fusion approaches are either not data-driven and cannot exploit prior... -
FusionMNIST
Sensor fusion can significantly improve the performance of many computer vision tasks. However, traditional fusion approaches are either not data-driven and cannot exploit prior...