-
Flickr1024
Stereo image super-resolution aims to improve the quality of high-resolution stereo image pairs by exploiting complementary information across views. -
Training-free Object Counting with Prompts
Object counting in images -
MPI Sintel
The dataset used in the paper for unsupervised single image intrinsic decomposition. -
Middlebury
The Middlebury dataset is a benchmark for stereo vision and 3D reconstruction. -
ImageNet-Sketch
ImageNet-Sketch is used as target dataset for domain adaptation. -
SVHN, MNIST, and MNIST-M
SVHN, MNIST, and MNIST-M are used as source datasets for domain adaptation. -
CIFAR-10-C, CIFAR-100-C, and ImageNet-C
CIFAR-10-C, CIFAR-100-C, and ImageNet-C are used as target datasets for corruption robustness evaluation. -
Common corruptions and perturbations for evaluating robustness
Common corruptions and perturbations are used to evaluate the robustness of neural networks. -
Robustifying Vision Transformer without Retraining from Scratch
Vision Transformer (ViT) is becoming more popular in image processing. We investigate the effectiveness of test-time adaptation (TTA) on ViT, a technique that has emerged to... -
Benchmarking neural network robustness to common corruptions and perturbations
Benchmarking neural network robustness to common corruptions and perturbations. -
Waste Classification using Computer Vision and Deep Learning
Dataset for waste classification using computer vision and deep learning -
Pascal VOC
Semantic segmentation is a crucial and challenging task for image understanding. It aims to predict a dense labeling map for the input image, which assigns each pixel a unique... -
dsprites: Disentanglement testing sprites dataset
dsprites: Disentanglement testing sprites dataset -
Cityscapes
The Cityscapes dataset is a large and famous city street scene semantic segmentation dataset. 19 classes of which 30 classes of this dataset are considered for training and... -
KITTI dataset
The dataset used in the paper is the KITTI dataset, which is a benchmark for monocular depth estimation. The dataset consists of a large collection of images and corresponding...