-
Efficient CNN with uncorrelated Bag of Features
The proposed approach is evaluated on three different datasets: MNIST, fashionMNIST, and CIFAR10. -
Epic-Kitchens VISOR Benchmark
Egocentric video dataset for object detection and segmentation -
Kaolin-Wisp Dataset
The dataset used in the paper is the Kaolin-Wisp dataset, which is a benchmark for neural fields research. -
Middlebury 2014
The Middlebury 2014 dataset is a benchmark for stereo matching, consisting of 33 pairs of stereo images with sparse depth ground truth. -
VIMER-UFO Benchmark
The VIMER-UFO benchmark dataset consists of 8 computer vision tasks: CPLFW, Market1501, DukeMTMC, MSMT-17, Veri-776, VehicleId, VeriWild, and SOP. -
SD-Measure: A Social Distancing Detector
The proposed framework for detecting social distancing from video footage -
Synthetic Fisheye Dataset for Fisheye Images
A synthetic fisheye dataset based on the ImageNet-1K, constructed to explore the performance of Transformer models on fisheye images. -
Scattering Networks for Hybrid Representation Learning
Scattering networks are a class of designed Convolutional Neural Networks (CNNs) with fixed weights. We argue they can serve as generic representations for modeling images. -
SCGM dataset
The dataset used for training and testing the proposed deep co-training method for semi-supervised image segmentation. -
LLFF, Replica, and ScanNet datasets
The dataset used in the paper to evaluate the proposed FG-NeRF method for uncertainty estimation in neural radiance fields. -
Deep Epitomic Convolutional Neural Networks
Deep convolutional neural networks have recently proven extremely competitive in challenging image recognition tasks. This paper proposes the epitomic convolution as a new... -
Real-World Fisheye Image Rectification Dataset
The dataset is used for evaluating the performance of fisheye image rectification algorithms. It contains real-world fisheye images with their corresponding ground truth images... -
Synthetic Fisheye Image Rectification Dataset
The dataset is used for fisheye image rectification. It contains synthetic images with fisheye distortion and their corresponding ground truth images, distortion parameters, and... -
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across...
The dataset used in the paper is a large-scale comparison of pretrained models across computer vision tasks. -
Multi-step Pick and Place
The Multi-step Pick and Place dataset is a collection of images and labels used for training a visual representation that can bridge the sim2real visual gap. -
Stack Object
The Stack Object dataset is a collection of images and labels used for training a visual representation that can bridge the sim2real visual gap. -
Lang4Sim2Real
The Lang4Sim2Real dataset is a collection of image-language pairs used for training a visual representation that can bridge the sim2real visual gap.