-
Depth Estimation from Monocular Images and Sparse Radar Data
The dataset is used for depth estimation from monocular images and sparse radar data. -
Depth Map Super-Resolution
Depth map super-resolution (DMSR) is a practical and valuable computer vision task. DMSR requires upscaling a low-resolution (LR) depth map into a high-resolution (HR) space. -
PASCAL Visual Object Classes Challenge
The PASCAL Visual Object Classes Challenge (VOC) is a benchmark dataset for object detection and semantic segmentation. -
Towards Metrical Reconstruction of Human Faces
A dataset for metrical reconstruction of human faces. -
Multiview Face Capture using Polarized Spherical Gradient Illumination
A dataset for capturing 3D faces using polarized spherical gradient illumination. -
A 3D Morphable Model of Craniofacial Shape and Texture Variation
A 3D morphable model of craniofacial shape and texture variation. -
FitDiff: Robust monocular 3D facial shape and reflectance estimation using Di...
A diffusion-based 3D facial generative model conditioned on identity embeddings. -
Conformer: Local Features Coupling Global Representations
Conformer is a dual network structure that combines CNN-based local features with transformer-based global representations for enhanced representation learning. -
Scannet: Richly-annotated 3D reconstructions of indoor scenes
Scannet: Richly-annotated 3D reconstructions of indoor scenes. -
Google Scanned Objects Dataset
The Google Scanned Objects dataset is a collection of over one thousand 3D-scanned household items. The dataset is used for evaluation of the proposed ConsistNet model. -
GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI ...
Human-Object Interaction (HOI) detection is a significant task to make a machine understand human activities in a static image at a fine-grained level. -
CIFAR10, CIFAR100, SVHN, ImageNet
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used four widely used datasets: CIFAR10, CIFAR100, SVHN, and ImageNet. -
Perspective Crop Layers (PCLs)
Local processing is an essential feature of CNNs and other neural network architectures—it is one of the reasons why they work so well on images where relevant information is,... -
DRIVE Dataset
The DRIVE dataset includes 40 color fundus photographs divided into two parts: 20 training images and 20 test images with manual segmentation of the vessels and binary masks of... -
Instance-Aware Graph Convolutional Network for Multi-Label Classification
Graph convolutional neural network (GCN) has effectively boosted the multi-label image recognition task by introducing label dependencies based on statistical label... -
Microsoft Bing Images API
Aerial images of London. -
Google Street View API
Street View images of London.