RGB-D scene recognition approaches often train two standalone backbones for RGB and depth modalities with the same Places or ImageNet pre-training. However, the pre-trained...
The dataset used in the paper is a real-world 3D point cloud dataset, which is used for 3D shape classification, part segmentation, and shape retrieval tasks.