-
Self-driving Dataset
Self-driving dataset is a dataset of pedestrians in various environments. -
LiDAR Localizability Estimation Dataset
The dataset used for training and testing the proposed localizability estimation approach. -
NRMVS: Non-Rigid Multi-View Stereo
A dataset for non-rigid multi-view stereo (NRMVS) reconstruction from sparse, unordered RGB images with non-rigid changes. -
3D Movie Camera Dataset
The dataset is used for the 3D movie camera implementation. It contains 3D data acquired using the Flying Triangulation method. -
MIT Intrinsic Images Dataset
A large-scale object non-Lambertian intrinsics database based on ShapeNet, a large-scale 3D shape dataset. -
ShapeNet-Intrinsics Dataset
A large-scale object non-Lambertian intrinsics database based on ShapeNet, a large-scale 3D shape dataset. -
Unsupervised Deep Single-Image Intrinsic Decomposition
The dataset used for training and testing the proposed deep single-image intrinsic decomposition model. -
3D Shape Coverage Estimation Dataset
The dataset used in this paper is a 3D shape coverage estimation dataset. -
Imagenette
The Imagenette dataset used in the paper for class density and dataset quality in high-dimensional, unstructured data. -
Subway Station Pedestrian Dataset
A dataset for pedestrian counting in subway surveillance videos -
vHeat: Building Vision Models upon Heat Conduction
A fundamental problem in learning robust and expressive visual representations lies in efficiently estimating the spatial relationships of visual semantics throughout the entire... -
MPII Human Pose Dataset
Human pose estimation refers to the task of recognizing postures by localizing body keypoints (head, shoulders, elbows, wrists, knees, ankles, etc.) from images. -
COCO, ADE20K, PASCAL Context, and LVIS datasets
COCO dataset, ADE20K dataset, PASCAL Context dataset, LVIS dataset -
Tetrahedron Splatting for 3D Generation
The dataset used in the paper for 3D generation using TeT-Splatting. -
Independent Sign Language Recognition with 3D Body, Hands, and Face Reconstru...
Independent Sign Language Recognition is a complex visual recognition problem that combines several challenging tasks of Computer Vision due to the necessity to exploit and fuse... -
Skin Cancer MNIST (HAM10000) dataset
The Skin Cancer MNIST (HAM10000) dataset is a good use case to assess the capabilities of attention mechanisms in neural networks. -
Dataset Distillation by Automatic Training Trajectories
Dataset Distillation by Automatic Training Trajectories -
Wide-area image geolocalization with aerial reference imagery
The CVUSA and CVACT datasets are used for cross-view geolocalization. The VIGOR dataset is used for cross-view image retrieval and 3-DoF pose estimation. -
C-BEV: Contrastive Bird’s Eye View Training for Cross-View Image Retrieval an...
The CVUSA and CVACT datasets are used for cross-view geolocalization. The VIGOR dataset is used for cross-view image retrieval and 3-DoF pose estimation.