-
AS-MLP: Axial Shifted MLP Architecture for Vision
An Axial Shifted MLP architecture (AS-MLP) is proposed for vision. Different from MLP-Mixer, where the global spatial feature is encoded for information flow through matrix... -
User-Controllable Latent Transformer for StyleGAN Image Layout Editing
The dataset used in the paper for user-controllable latent code transformation for StyleGAN image layout editing. -
Pacific Graphics 2022
The dataset used in the paper for user-controllable latent code transformation for StyleGAN image layout editing. -
Going Deeper with Convolutions
The dataset used for training and testing the proposed method. -
ChestX-ray8
A hospital-scale chest X-ray database, namely “ChestX-ray8”, which comprises 108,948 frontal-view X-ray images of 32,717 unique patients with the text-mined eight common disease... -
Learning compact representations for LiDAR completion and generation
Learning compact representations for LiDAR completion and generation. -
LidarDM: Generative LiDAR Simulation in a Generated World
LidarDM: A novel layout-conditioned latent diffusion model for generating realistic LiDAR point clouds. -
Occ3D-nuScenes
Occupancy prediction plays a pivotal role in au- -
Graph Matching
Graph matching (GM) constitutes a pervasive problem in computer vision and pattern recognition, with applications in image registration, recognition, stereo, 3D shape matching,... -
Contextual Convolution
Contextual convolution (CoConv) for visual recognition. CoConv is a direct replacement of the standard convolution that can be used at any stage in CNN architectures. -
EuRoC MAV Dataset
A dataset for visual odometry, containing 11 sequences with provided ground-truth poses. -
Flying Chairs
The dataset used in the paper is not explicitly described, but it is mentioned that the authors applied their approach to the challenging problem of optical flow estimation and... -
CNN Model Dataset
The dataset used in this paper is a dataset of four CNN models: ResNet-18, Vgg-16, Squeezenet v1.0, and AlexNet. -
Acoustic AVSpeech
The Acoustic AVSpeech dataset is a benchmark for visual acoustic matching. -
SoundSpaces-Speech
The SoundSpaces-Speech dataset is a benchmark for visual acoustic matching. -
Oxford Radar RobotCar dataset
The Oxford Radar RobotCar dataset is a radar extension to the Oxford RobotCar dataset, containing radar observations of a vehicle in various scenarios. -
VGGHeads: A Large-Scale Synthetic Dataset for 3D Human Heads
A large-scale synthetic dataset for human head detection and 3D mesh estimation.