Dataset - LDM

NeRF

NeRF [33] has demonstrated amazing ability to synthesize images of 3D scenes from novel views. However, they rely upon specialized volumetric rendering algorithms based on ray...
- Dataset
- JSON
DomainNet

The dataset used in the paper is a cross-domain dataset, consisting of six domains: Real, Painting, Sketch, Clipart, Infograph, and Quickdraw. Each domain contains 345 object...
- Dataset
- JSON
SCUT-HEAD Dataset

The SCUT-HEAD dataset is a head detection dataset containing images with varying scales and poses.
- Dataset
- JSON
Visual Wake Words Dataset

The Visual Wake Words dataset is a binary classification dataset for detecting the presence of a person in an image.
- Dataset
- JSON
ImageNet-10 Dataset

The ImageNet-10 dataset is a subset of the ImageNet-1K dataset, containing images from 10 classes.
- Dataset
- JSON
WIDER FACE Dataset

The WIDER FACE dataset is a face detection dataset containing images with varying scales, poses, and occlusions.
- Dataset
- JSON
Vision-based Target Pose Estimation with Multiple Markers for the Perching of...

A vision-based target pose estimation method using multiple markers for high-precision nano drone perching at both wide and close ranges.
- Dataset
- JSON
MS-COCO

Large scale datasets [18, 17, 27, 6] boosted text conditional image generation quality. However, in some domains it could be difficult to make such datasets and usually it could...
- Dataset
- JSON
dsprites: Disentanglement testing sprites dataset

dsprites: Disentanglement testing sprites dataset
- Dataset
- JSON
MegaDepth

Feature matching is a fundamental problem for many computer vision tasks, such as object recognition, structure from motion, and simultaneous localization and mapping.
- Dataset
- JSON
DIV2K

Single Image Super-Resolution (SR) aims to generate a High Resolution (HR) image I SR from a low resolution (LR) im-age I LR such that it is similar to original HR image I HR....
- Dataset
- JSON
LSUN

The dataset used for training and validation of the proposed approach to combine semantic segmentation and dense outlier detection.
- Dataset
- JSON
CLIP

The CLIP model and its variants are becoming the de facto backbone in many applications. However, training a CLIP model from hundreds of millions of image-text pairs can be...
- Dataset
- JSON
DDAD dataset

The DDAD dataset is a new autonomous driving benchmark from Toyota Research Institute for long-range (up to 250m).
- Dataset
- JSON
MSDC-Net: Multi-Scale Dense and Contextual Networks for Automated Disparity M...

Disparity prediction from stereo images is essential to computer vision applications including autonomous driving, 3D model reconstruction, and object detection.
- Dataset
- JSON
Cityscapes

The Cityscapes dataset is a large and famous city street scene semantic segmentation dataset. 19 classes of which 30 classes of this dataset are considered for training and...
- Dataset
- JSON
KITTI dataset

The dataset used in the paper is the KITTI dataset, which is a benchmark for monocular depth estimation. The dataset consists of a large collection of images and corresponding...
- Dataset
- JSON
ShapeNetCore

The ShapeNetCore dataset is a large-scale 3D model dataset, containing 44,000 3D models and 13 categories.
- Dataset
- JSON
CIFAR-10, CIFAR-100, and ImageNet

The dataset used in the paper is not explicitly described, but it is mentioned that the authors used CIFAR-10, CIFAR-100, and ImageNet datasets.
- Dataset
- JSON
Bollywood dataset

The Bollywood dataset is a collection of images of Bollywood celebrities with varying body mass indexes (BMIs). The dataset is used for face-to-BMI prediction.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

1,082 datasets found