Dataset - LDM

ILSVRC2012

The dataset used in the paper is not explicitly described, but it is mentioned that the authors used a subset of the validation dataset used for the ImageNet Large Scale Visual...
- Dataset
- JSON
PASCAL VOC 2007

Multi-label image recognition is a practical and challenging task compared to single-label image classification.
- Dataset
- JSON
Pascal Visual Object Classes (VOC) Challenge

The Pascal Visual Object Classes (VOC) challenge is a benchmark for object detection and segmentation.
- Dataset
- JSON
3D-MNIST

A dataset of 3D shapes, consisting of 30,000 images of 6,196 objects.
- Dataset
- JSON
Reading digits in natural images with unsupervised feature learning

The paper presents a method for reading digits in natural images using unsupervised feature learning.
- Dataset
- JSON
SVHN Dataset

The dataset used in the paper is a collection of images from the SVHN dataset, along with labels. The dataset is used for image classification.
- Dataset
- JSON
VGG-16 Dataset

The VGG-16 dataset is a large collection of images of objects.
- Dataset
- JSON
ILSVRC

ILSVRC is a large-scale image dataset containing over 1.2 million images across 1,000 classes.
- Dataset
- JSON
MNIST Database

The MNIST database of handwritten digits is a popular benchmark data set for classiﬁcation algorithms.
- Dataset
- JSON
Multiscale Vision Transformers

Multiscale Vision Transformers (MViT) for video and image recognition, by connecting the seminal idea of multiscale feature hierarchies with transformer models.
- Dataset
- JSON
ImageNet Dataset

Object recognition is arguably the most important problem at the heart of computer vision. Recently, Barbu et al. introduced a dataset called ObjectNet which includes objects in...
- Dataset
- JSON
CIFAR-10 Dataset

The dataset used in this paper is a neural network, and the authors used it to test the performance of their lookahead pruning method.
- Dataset
- JSON
OpenImages

Large-scale vision-and-language models trained on curated and web-scrapped data have led to significant improvements over task-specific models when transferred to downstream...
- Dataset
- JSON
MS-COCO

Large scale datasets [18, 17, 27, 6] boosted text conditional image generation quality. However, in some domains it could be difficult to make such datasets and usually it could...
- Dataset
- JSON
LSUN

The dataset used for training and validation of the proposed approach to combine semantic segmentation and dense outlier detection.
- Dataset
- JSON
LVIS

Instance segmentation (IS) is an important computer vision task, aiming at simultaneously predicting the class label and the binary mask for each instance of interest in an image.
- Dataset
- JSON
An image is worth 16x16 words: Transformers for image recognition at scale

An image is worth 16x16 words: Transformers for image recognition at scale.
- Dataset
- JSON
Microsoft COCO

The Microsoft COCO dataset was used for training and evaluating the CNNs because it has become a standard benchmark for testing algorithms aimed at scene understanding and...
- Dataset
- JSON
ImageNet Large Scale Visual Recognition Challenge

A benchmark for low-shot recognition was proposed by Hariharan & Girshick (2017) and consists of a representation learning phase without access to the low-shot classes and a...
- Dataset
- JSON
FFHQ

Large scale datasets [18, 17, 27, 6] boosted text conditional image generation quality. However, in some domains it could be difficult to make such datasets and usually it could...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

63 datasets found