-
FlyingThings3D dataset
The FlyingThings3D dataset is a benchmark for stereo matching, consisting of a large collection of images and corresponding disparity maps. -
A Weakly-Supervised Depth Estimation Network Using Attention Mechanism
A weakly-supervised depth estimation network using attention mechanism for cases with wrong labels. -
MegaDepth dataset
The dataset used for training the D2-Net model, consisting of 327,036 image pairs with at least 50% overlap in the sparse SfM point cloud. -
Symmetry-awareDepthEstimationusingDeepNeuralNetworks
The dataset for Symmetry-awareDepthEstimationusingDeepNeuralNetworks, containing product images with reflection symmetry. -
Deep Depth From Focus
Depth from focus (DFF) is a highly ill-posed inverse problem because the optimal focal distance is inferred from sharpness measures which fail in untextured areas. Existing... -
Depth Estimation from Monocular Images and Sparse Radar Data
The dataset is used for depth estimation from monocular images and sparse radar data. -
NYUv2 dataset
The NYUv2 dataset is a large-scale dataset for 3D object recognition and semantic segmentation. It contains 206 test set video sequences with 135 classes. -
NYU-Depth V2
The NYU-Depth V2 dataset contains pairs of RGB and depth images collected from Microsoft Kinect in 464 indoor scenes. -
DepthP+P: Metric Accurate Monocular Depth Estimation using Planar and Parallax
DepthP+P: A method for self-supervised monocular depth estimation using planar and parallax. -
MonoDiffusion: Self-Supervised Monocular Depth Estimation Using Diffusion Model
MonoDiffusion: A novel self-supervised monocular depth estimation framework by reformulating it as an iterative denoising process. -
METER: a mobile vision transformer architecture for monocular depth estimation
Monocular depth estimation is a fundamental knowledge for autonomous systems that need to assess their own state and perceive the surrounding environment. -
KITTI 2012
KITTI 2012 is a real-world dataset in the outdoor scenario, and contains 194 training and 195 testing stereo image pairs with the size of 376 × 1240. -
Joint Prediction of Monocular Depth and Structure using Planar and Parallax G...
The dataset used in the paper is the KITTI Vision Benchmark and Cityscapes dataset for monocular depth estimation and structure prediction. -
Cityscapes
The Cityscapes dataset is a large and famous city street scene semantic segmentation dataset. 19 classes of which 30 classes of this dataset are considered for training and... -
KITTI dataset
The dataset used in the paper is the KITTI dataset, which is a benchmark for monocular depth estimation. The dataset consists of a large collection of images and corresponding...