-
MNIST data set
The MNIST data set is a dataset used for testing the performance of the DisDF. -
Caltech-UCSD Birds
Caltech-UCSD Birds (CUB 200-2007) and extended version CUB 200-2011 image collections tagged with keypoints, bounding boxes, coarse segmentation, and attribute labels. -
Imagenette
The Imagenette dataset used in the paper for class density and dataset quality in high-dimensional, unstructured data. -
Scattering Networks for Hybrid Representation Learning
Scattering networks are a class of designed Convolutional Neural Networks (CNNs) with fixed weights. We argue they can serve as generic representations for modeling images. -
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transfo...
Semantic segmentation is a fundamental task in computer vision and enables many downstream applications. It is related to image classification since it produces per-pixel... -
ImageNet1K and ImageNet21K for image classification, and MS COCO for object d...
The dataset used in the paper is ImageNet1K and ImageNet21K for image classification, and MS COCO for object detection. -
Open Images Dataset
The dataset used in the experiment consists of 50 images equally distributed between five classes: aircraft, bird, bicycle, boat, and dog. Each class has 5 true positive images... -
Nested Hierarchical Transformer
The dataset used in the paper is not explicitly mentioned, but it is implied to be ImageNet and CIFAR-10/100. -
STL-10 dataset
The dataset used in this paper is a collection of images from the STL-10 dataset, preprocessed and used for training and evaluation of the proposed diffusion spectral entropy... -
ImageNet Large Scale Visual Recognition Challenge (ILSVRC)
The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) dataset is a large-scale image classification dataset containing over 14 million images from 21,841 categories. -
PACS dataset
The dataset used in the paper is a large collection of small images, each representing a patch of a jigsaw puzzle. The patches are of the same size and orientation, and the goal... -
Waterbirds
Dataset bias is a significant problem in training fair classifiers. When attributes unrelated to classification exhibit strong biases towards certain classes, classifiers... -
VisionLLaMA: A Unified LLaMA Interface for Vision Tasks
VisionLLaMA is a unified and generic modeling framework for solving most vision tasks. -
ImageNet: A Large-Scale Hierarchical Image Database
The ImageNet dataset is a large-scale image database that contains over 14 million images, each labeled with one of 21,841 categories. -
ImageNet-1000
The dataset used in this paper is ImageNet-1000 pre-trained CNNs.