-
Constrained Grasp Diffusion Fields
Constrained 6-DoF Grasp Generation on Complex Shapes for Improved Dual-Arm Manipulation -
D-Cubed: Latent Diffusion Trajectory Optimisation for Dexterous Deformable Ma...
A task-agnostic play dataset of robot hand trajectories Dplay is collected per robot platform, allowing its reuse across multiple tasks. -
S-Adapter: Generalizing Vision Transformer for Face Anti-Spoofing with Statis...
Face Anti-Spoofing (FAS) aims to detect malicious attempts to invade a face recognition system by presenting spoofed faces. State-of-the-art FAS techniques predominantly rely on... -
MegaDepth and NYU datasets
The dataset used in the paper is MegaDepth and NYU datasets for training and testing the proposed method. -
RSNA 2019 Brain CT Hemorrhage Challenge dataset
The RSNA 2019 Brain CT Hemorrhage Challenge dataset is a dataset of CT scans of the brain used for hemorrhage detection and classification. -
TPC-ViT: Token Propagation Controller for Efficient Vision Transformers
Vision transformers (ViTs) have achieved promising results on a variety of Computer Vision tasks, however their quadratic complexity in the number of input tokens has limited... -
Synthia 4D
The Synthia 4D dataset is a synthetic dataset for 4D semantic segmentation. -
RotDCF: Rotation-Equivariant Deep Networks
The paper proposes a decomposition of the convolutional filters over joint steerable bases across the space and the group geometry simultaneously, namely a rotation-equivariant... -
Unified Image and Video Saliency Modeling
The proposed model for unified image and video saliency analysis. -
Efficient CNN with uncorrelated Bag of Features
The proposed approach is evaluated on three different datasets: MNIST, fashionMNIST, and CIFAR10. -
Epic-Kitchens VISOR Benchmark
Egocentric video dataset for object detection and segmentation -
Kaolin-Wisp Dataset
The dataset used in the paper is the Kaolin-Wisp dataset, which is a benchmark for neural fields research. -
Middlebury 2014
The Middlebury 2014 dataset is a benchmark for stereo matching, consisting of 33 pairs of stereo images with sparse depth ground truth. -
VIMER-UFO Benchmark
The VIMER-UFO benchmark dataset consists of 8 computer vision tasks: CPLFW, Market1501, DukeMTMC, MSMT-17, Veri-776, VehicleId, VeriWild, and SOP. -
SD-Measure: A Social Distancing Detector
The proposed framework for detecting social distancing from video footage -
Synthetic Fisheye Dataset for Fisheye Images
A synthetic fisheye dataset based on the ImageNet-1K, constructed to explore the performance of Transformer models on fisheye images. -
Scattering Networks for Hybrid Representation Learning
Scattering networks are a class of designed Convolutional Neural Networks (CNNs) with fixed weights. We argue they can serve as generic representations for modeling images.