-
DIODE and KITTI
The dataset used for monocular depth estimation task -
FlyingThings3D dataset
The FlyingThings3D dataset is a benchmark for stereo matching, consisting of a large collection of images and corresponding disparity maps. -
KITTI 2015 dataset
KITTI 2015 dataset contains videos in 200 street scenes captured by RGB cameras, with sparse depth ground truths captured by Velodyne laser scanner. -
NYUv2 dataset
The NYUv2 dataset is a large-scale dataset for 3D object recognition and semantic segmentation. It contains 206 test set video sequences with 135 classes. -
CityScapes dataset
Monocular depth estimation dataset -
ACDNet: Adaptively Combined Dilated Convolution for Monocular Panorama Depth ...
Depth estimation is a crucial step for 3D reconstruction with panorama images in recent years. Panorama images maintain the complete spatial information but introduce distortion... -
Transformer-Based Attention Networks for Continuous Pixel-Wise Prediction
The proposed TransDepth framework for pixel-wise prediction problems involving continuous labels. -
DepthP+P: Metric Accurate Monocular Depth Estimation using Planar and Parallax
DepthP+P: A method for self-supervised monocular depth estimation using planar and parallax. -
METER: a mobile vision transformer architecture for monocular depth estimation
Monocular depth estimation is a fundamental knowledge for autonomous systems that need to assess their own state and perceive the surrounding environment. -
Mono-ViFI: A Unified Framework for Self-supervised Monocular Depth Estimation
Self-supervised monocular depth estimation has gathered no-table interest since it can liberate training from dependency on depth annotations. In monocular video training case,... -
Joint Prediction of Monocular Depth and Structure using Planar and Parallax G...
The dataset used in the paper is the KITTI Vision Benchmark and Cityscapes dataset for monocular depth estimation and structure prediction. -
KITTI dataset
The dataset used in the paper is the KITTI dataset, which is a benchmark for monocular depth estimation. The dataset consists of a large collection of images and corresponding...