-
Balanced Binary Neural Networks with Gated Residual
Binary neural networks have attracted numerous attention in recent years. However, mainly due to the information loss stemming from the biased binarization, how to preserve the... -
Visual Context-Aware Convolution Filters for Transformation-Invariant Neural ...
The proposed framework generates a unique set of context-dependent filters based on the input image, and combines them with max-pooling to produce transformation-invariant... -
KITTI and Ford Multi-AV Seasonal datasets
KITTI and Ford Multi-AV Seasonal datasets are used for training and evaluation of the proposed method. -
KITTI-CVL and FordAV-CVL datasets
Two cross-view localization datasets, KITTI-CVL and FordAV-CVL, are constructed by collecting spatial-consistent satellite counterparts from Google Map according to the provided... -
KITTI Benchmark Dataset
The KITTI benchmark dataset is used to evaluate the performance of the proposed method. The dataset contains large-scale outdoor sequences of images captured by a forward-facing... -
PartNetE Dataset
Constrained 6-DoF Grasp Generation on Complex Shapes for Improved Dual-Arm Manipulation -
ACRONYM Dataset
Constrained 6-DoF Grasp Generation on Complex Shapes for Improved Dual-Arm Manipulation -
DA2 Dataset
Constrained 6-DoF Grasp Generation on Complex Shapes for Improved Dual-Arm Manipulation -
Constrained Grasp Diffusion Fields
Constrained 6-DoF Grasp Generation on Complex Shapes for Improved Dual-Arm Manipulation -
D-Cubed: Latent Diffusion Trajectory Optimisation for Dexterous Deformable Ma...
A task-agnostic play dataset of robot hand trajectories Dplay is collected per robot platform, allowing its reuse across multiple tasks. -
S-Adapter: Generalizing Vision Transformer for Face Anti-Spoofing with Statis...
Face Anti-Spoofing (FAS) aims to detect malicious attempts to invade a face recognition system by presenting spoofed faces. State-of-the-art FAS techniques predominantly rely on... -
MegaDepth and NYU datasets
The dataset used in the paper is MegaDepth and NYU datasets for training and testing the proposed method. -
RSNA 2019 Brain CT Hemorrhage Challenge dataset
The RSNA 2019 Brain CT Hemorrhage Challenge dataset is a dataset of CT scans of the brain used for hemorrhage detection and classification. -
TPC-ViT: Token Propagation Controller for Efficient Vision Transformers
Vision transformers (ViTs) have achieved promising results on a variety of Computer Vision tasks, however their quadratic complexity in the number of input tokens has limited... -
Synthia 4D
The Synthia 4D dataset is a synthetic dataset for 4D semantic segmentation. -
RotDCF: Rotation-Equivariant Deep Networks
The paper proposes a decomposition of the convolutional filters over joint steerable bases across the space and the group geometry simultaneously, namely a rotation-equivariant... -
Unified Image and Video Saliency Modeling
The proposed model for unified image and video saliency analysis.