-
3DCoMPaT++
3DCoMPaT++: An improved Large-scale 3D Vision Dataset for Compositional Recognition. -
Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling
Point-BERT is a new paradigm for learning point cloud Transformers. It pre-trains standard point cloud Transformers with a Masked Point Modeling (MPM) task. -
KITTI-360: A novel dataset and benchmarks for urban scene understanding in 2D...
A dataset for urban scene understanding in 2D and 3D. -
The KITTI Vision Benchmark Suite
A benchmark suite for 3D vision tasks. -
Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D
The dataset used in the paper is a multi-view image dataset, where each view is a 2D image of a 3D scene. The dataset is used to evaluate the performance of the Lift3D method,... -
3D Vision with Transformers: A Survey
The dataset is a comprehensive review of over 100 transformer methods for different 3D vision tasks, including classification, segmentation, detection, completion, pose... -
KITTI dataset
The dataset used in the paper is the KITTI dataset, which is a benchmark for monocular depth estimation. The dataset consists of a large collection of images and corresponding...