No Organization - Organizations

KITTI Vision Benchmark Suite

The KITTI Vision Benchmark Suite is a dataset used for object detection and tracking in autonomous vehicles.
- Dataset
- JSON
Zero-1-to-3

Zero-1-to-3: Zero-shot one image to 3D object.
- Dataset
- JSON
Position Embedding Needs an Independent Layer Normalization

The dataset used in the paper is not explicitly described, but it is mentioned that the authors analyzed the input and output of each encoder layer in Vision Transformers (VTs)...
- Dataset
- JSON
DINO dataset

The DINO dataset: A large-scale vision transformer dataset
- Dataset
- JSON
Caltech-UCSD Birds 200

The Caltech-256 object category dataset is used for the feature extraction step, and the Omniglot dataset is used for the evaluation.
- Dataset
- JSON
Middlebury

The Middlebury dataset is a benchmark for stereo vision and 3D reconstruction.
- Dataset
- JSON
StereoDiffusion

StereoDiffusion: Training-Free Stereo Image Generation Using Latent Diffusion Models
- Dataset
- JSON
Shapenet

Shapenet is a large-scale synthesis 3D object dataset, where we follow [9] to use the official test splits of chair, car, and motorbike categories for evaluation since they...
- Dataset
- JSON
YOLOv2

The dataset used in the paper is a publicly available dataset for object detection.
- Dataset
- JSON
XCiT: Cross-Covariance Image Transformers

Following tremendous success in natural language processing, transformers have re-
- Dataset
- JSON
VoxelNet

The VoxelNet dataset is a large-scale dataset for 3D object detection, consisting of 3D point clouds and corresponding annotations.
- Dataset
- JSON
KITTI Benchmark Suite

The KITTI benchmark suite is a large-scale dataset for 3D object detection, consisting of 7,481 training samples and 7,518 test samples.
- Dataset
- JSON
DeepFashion3D

DeepFashion3D is a dataset for 3D garment reconstruction from single images.
- Dataset
- JSON
ImageNet-Compatible and CIFAR-10 datasets

The authors used the ImageNet-Compatible and CIFAR-10 datasets for targeted attack experiments.
- Dataset
- JSON
Reading digits in natural images with unsupervised feature learning

The paper presents a method for reading digits in natural images using unsupervised feature learning.
- Dataset
- JSON
Image COCO

The Image COCO 3 dataset’s image caption annotations, where we sample 4 10,000 sentences as training set and another 10,000 as test set.
- Dataset
- JSON
Selecting Receptive Fields in Deep Networks

The authors used the CIFAR-10 dataset for evaluating the quality of unsupervised representation learning algorithms.
- Dataset
- JSON
Deep Convolutional Generative Adversarial Networks

The authors used three datasets: Large-scale Scene Understanding (LSUN), Imagenet-1k, and a newly assembled Faces dataset.
- Dataset
- JSON
Facial Makeup Transfer Dataset

This dataset is used for facial makeup transfer.
- Dataset
- JSON
GraspNet-1Billion

GraspNet-1Billion is a large-scale real-world grasping dataset containing 190 cluttered grasping scenes and 97,280 RGB-D images captured by 2 kinds of RGB-D cameras from 256...
- Dataset
- JSON

1,082 datasets found