-
CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D Networks
3D Convolution Neural Networks (CNNs) have been widely applied to 3D scene understanding, such as video analysis and volumetric image recognition. -
Places dataset
The Places dataset is a large-scale dataset for scene recognition, containing 1 million images from 365 categories. -
Visual Genome
The Visual Genome dataset is a large-scale visual question answering dataset, containing 1.5 million images, each with 15-30 annotated entities, attributes, and relationships. -
Common Objects in 3D
Common Objects in 3D: Large-scale learning and evaluation of real-life 3D category reconstruction -
Google Scanned Objects
Google Scanned Objects is a real-scanned 3D object dataset and we use all its 1030 samples for evaluation unlike existing works [29, 31] that only use 30 of them. -
Objaverse: A universe of annotated 3D objects
A large-scale dataset of 3D objects for training and testing 3D reconstruction models.