-
PVT-SSD: Single-Stage 3D Object Detector with Point-Voxel Transformer
Recent Transformer-based 3D object detectors learn point cloud features either from point- or voxel-based representations. -
Discrete-Valued Neural Communication
The dataset used in the paper is a visual reasoning task using Graph Neural Networks (GNNs) and Recurrent Independent Mechanisms (RIMs). The dataset consists of 8 Atari games... -
Training Transformers to Perform Tasks
A dataset for training transformers to perform tasks such as language translation and text generation. -
3D Vision with Transformers: A Survey
The dataset is a comprehensive review of over 100 transformer methods for different 3D vision tasks, including classification, segmentation, detection, completion, pose... -
An image is worth 16x16 words: Transformers for image recognition at scale
An image is worth 16x16 words: Transformers for image recognition at scale.