-
Dual-Camera Smooth Zoom on Mobile Phones
The dataset is used for training and testing the proposed dual-camera smooth zoom (DCSZ) method. -
Weakly Supervised Gaussian Networks for Action Detection
Detecting temporal extents of human actions in videos is a challenging computer vision problem that requires detailed manual supervision including frame-level labels. -
Pascal VOC Keypoints
Pascal Visual Object Classification challenge consist of 20 image classes, where each image was parsed into an image graph using keypoints as nodes. -
Matching Map Recovery with an Unknown Number of Outliers
The dataset used in the paper is a set of feature-vectors from two sets of d-dimensional noisy feature-vectors. -
Harbour Bridge
A dataset of tricamera stereo sequences for road-modeling in the presence of windscreen wipers. -
Tree Branch Dynamics for Manipulation
The dataset used for learning the dynamic behavior of tree branches by utilizing the simulation-based parameter inference approach. -
IIRC-CIFAR
The IIRC-CIFAR dataset is a benchmark for evaluating models in the Incremental Implicitly-Refined Classification (IIRC) setup. -
RL-CZSL-ATTR and RL-CZSL-ACT
Two large-scale benchmark datasets for reference-limited compositional zero-shot learning (RL-CZSL). -
New Efficient Visual OILU Markers
The dataset is used to develop new efficient visual markers based on the OILU numbering system. -
Light Field Reconstruction from Focal Stack
A dataset for partially reconstructing high-resolution 4D light fields from a stack of differently focused photographs taken with a fixed camera. -
ShiftAddViT: Towards Efficient Vision Transformers
ShiftAddViT: A hardware-inspired multiplication-reduced Vision Transformer model. -
Room-to-Room (R2R) dataset
The Room-to-Room (R2R) dataset is a benchmark for vision-and-language navigation tasks. It consists of 7,189 paths sampled from its navigation graphs, each with three... -
Toward a Visual Concept Vocabulary for GAN Latent Space
A new method for building open-ended vocabularies of primitive visual concepts represented in a GAN's latent space. -
Pointnet++: Deep hierarchical feature learning on point sets in a metric space
A hierarchical feature learning approach for 3D point cloud processing. -
Genetic Algorithm based hyper-parameters optimization for transfer Convolutio...
Hyperparameter optimization for transfer Convolutional Neural Networks (CNN) using Genetic Algorithm -
Interiornet
Interiornet: Mega-scale multi-sensor photo-realistic indoor scenes dataset. -
Physics-aware Simulation for Object Detection and Pose Estimation
This paper proposes a dataset generation pipeline that uses physics simulation to generate images of objects in cluttered scenes. -
EuRoc micro aerial vehicle (MAV) datasets
The dataset used in the paper is the EuRoc micro aerial vehicle (MAV) datasets. -
Rigidity Preserving Image Transformations and Equivariance in Perspective
The dataset used in the paper is the LINEMOD and Occlusion LINEMOD datasets, which are used for 6D object pose estimation.