-
First-person hand benchmark (FHB)
The FHB dataset contains egocentric RGB-D videos on a wide range of hand-object interactions. -
Open Images Dataset
The dataset used in the experiment consists of 50 images equally distributed between five classes: aircraft, bird, bicycle, boat, and dog. Each class has 5 true positive images... -
iNaturalist dataset
The iNaturalist dataset is a crowdsourced compendium of living organisms, with fine- and coarse-grained species distinctions, a heavy-tailed class size distribution, and... -
ResNet and WRN datasets
ResNet and WRN datasets used for image classification tasks -
VAW dataset
VAW dataset (Pham et al., 2021) is a large-scale visual attributes dataset with bounding box labels for the attribution annotation. -
CIFAR-100 and ImageNet-1k
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used the CIFAR-100 and ImageNet-1k datasets for image classification and semantic... -
Arbitrary Style Transfer with Structure Enhancement by Combining the Global a...
Arbitrary style transfer generates an artistic image which combines the structure of a content image and the artistic style of the artwork by using only one trained network. -
BraTS 2020: self-ensembled, deeply-supervised 3D U-Net CNNs
Brain tumor segmentation is a critical task for patient’s disease management. In order to automate and standardize this task, we trained multiple U-net like neural networks,... -
Nested Hierarchical Transformer
The dataset used in the paper is not explicitly mentioned, but it is implied to be ImageNet and CIFAR-10/100. -
ImageNet and CIFAR-10 datasets
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used VGG-16, ResNet-50, and MobileNet-v2 models on the ImageNet and CIFAR-10... -
Dex-NeRF: Using a Neural Radiance Field to Grasp Transparent Objects
Synthetic datasets of transparent objects for training NeRF models. -
CAMELYON-17
CAMELYON-17 consists of 145 positive slides and 353 negative slides, where positive patches occupying less than 10% of the tissue area in positive slides. -
Distilled Feature Fields Enable Open-Ended Manipulation
The dataset used in the paper is a collection of RGB images of a tabletop scene, along with their corresponding camera poses and 3D geometry. -
Geometry Sharing Network for 3D Point Cloud Classification and Segmentation
Geometry Sharing Network (GS-Net) for 3D point cloud classification and segmentation -
Liver Lesion Classification
Liver lesion dataset for classification using synthetic data augmentation -
KITTI-360 dataset
The KITTI-360 dataset is an extension of the KITTI dataset, containing 10 new sequences recorded in 2013, with a focus on 360-degree views. -
RePaint: Inpainting using Denoising Diffusion Probabilistic Models
The dataset used in the paper for image inpainting using Denoising Diffusion Probabilistic Models (DDPM).