344 datasets found

Tags: Computer Vision

Filter Results
  • KITTI-360 dataset

    The KITTI-360 dataset is an extension of the KITTI dataset, containing 10 new sequences recorded in 2013, with a focus on 360-degree views.
  • CIFAR-10-C and ImageNet-C

    The dataset used in the paper is CIFAR-10-C and ImageNet-C, which are common corruption benchmarks.
  • PRW

    Person search unifies person detection and person re-identification (Re-ID) to locate query persons from the panoramic gallery images. One major challenge comes from the...
  • CUHK-SYSU

    CUHK-SYSU is a large-scale dataset designed for person search, which contains 18,184 scene images captured from street nap and movie screenshot.
  • AWA2 (Animals with Attributes 2)

    The AWA2 dataset includes 50 classes of assorted animals totaling 37,322 samples, of which 10 categories are considered unseen classes. Attribute annotations are 85-dimensional.
  • SUN (SUN Attribute)

    The SUN dataset has a sample of 717 different scenes totaling 14,340 images, where 72 categories are unseen classes. Attribute annotations are 102-dimensional.
  • CUB (Caltech UCSD Birds 200)

    The CUB dataset comprises 200 bird species totaling 11,788 image samples, of which 50 categories are planned as unseen classes. The SUN dataset has a sample of 717 different...
  • STL-10 dataset

    The dataset used in this paper is a collection of images from the STL-10 dataset, preprocessed and used for training and evaluation of the proposed diffusion spectral entropy...
  • CIFAR100, ImageNet100, and ImageNet

    The dataset used in the paper is CIFAR100, ImageNet100, and ImageNet. CIFAR100 consists of 100 object classes and 60,000 images. ImageNet100 has 100 object classes and 60,000...
  • Cinematic Gaussians: Real-Time HDR Radiance Fields with Depth of Field

    The dataset consists of four synthetic rendered scenes and four real captured scenes. The synthetic scenes provide control and ground truth for evaluation, while the real...
  • DRIVE

    The DRIVE dataset contains the curvilinear-shaped vessel. This dataset consists of 40 565 × 584 color retinal images, which are split into 20 training images and 20 test images.
  • Mobile AI Workshop 2021

    A dataset for mobile AI workshop 2021.
  • Caltech Pedestrian

    The dataset used in the paper is a video prediction dataset with occlusions, which is used to evaluate the proposed Fast Fourier Inception Networks (FFINet) for occluded video...
  • ImageNet Large Scale Visual Recognition Challenge (ILSVRC)

    The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) dataset is a large-scale image classification dataset containing over 14 million images from 21,841 categories.
  • Image Inpainting

    The CelebAHQ dataset was used with a fixed removal mask located near the image centers [11].
  • ETH3D Benchmark

    The ETH3D Benchmark dataset contains a set of objects 3D coordinates, images in which these objects can be seen, the intrinsic parameters and the pose of each of the cameras...
  • DINOv2

    The dataset used in the paper is DINOv2, a vision foundation model trained on a large-scale dataset.
  • PACS dataset

    The dataset used in the paper is a large collection of small images, each representing a patch of a jigsaw puzzle. The patches are of the same size and orientation, and the goal...
  • MPI3D dataset

    The dataset used in the paper is a MPI3D dataset, which contains 3D images of objects with varying sizes and colors.
  • Shapes3D dataset

    The dataset used in the paper is a Shapes3D dataset, which contains 3D shapes with varying sizes and colors.
You can also access this registry using the API (see API Docs).