-
Neural scene flow fields for space-time view synthesis of dynamic scenes
Neural scene flow fields for space-time view synthesis of dynamic scenes. -
Nerfies: Casual Free-Viewpoint Selfies
Deformable Neural Radiance Fields extend NeRF by modeling non-rigidly deforming scenes. -
Dirty Cityscapes
A dataset of 10k images with artificially generated soiling patterns, used for training and testing the soiling detection model. -
Dirty WoodScape
A companion dataset to the WoodScape dataset, containing 10k images with artificially generated soiling patterns. -
Labeled Faces in the Wild (LFW) dataset
Labeled Faces in the Wild (LFW) dataset consists of 5749 subjects each having different number of images ranging from 1 to 530. -
DiffPose: Multi-hypothesis Human Pose Estimation using Diffusion Models
DiffPose: A conditional diffusion model for multi-hypothesis human pose estimation from a single image. -
Shadow Detection Datasets
The dataset used in this paper for shadow detection, consisting of 4 widely used benchmark datasets: SBU, UCF, ISTD, and CUHK. -
ExplainFix: Explainable Spatially Fixed Deep Networks
ExplainFix adopts two design principles: the “fixed filters” principle that all spatial filter weights of convolutional neural networks can be fixed at initialization and never... -
Lumiere: A Space-Time Diffusion Model for Video Generation
A dataset for video generation and video-based tasks. -
Rectified Binary Convolutional Networks for Enhancing the Performance of 1-bit...
The proposed rectified binary convolutional networks (RBCNs) are used to improve the performance of 1-bit DCNNs for mobile and AI chips based applications. -
Image Enhancement for Adverse Images
This paper uses the ImageNet and COCO2017 validation datasets for testing. -
MobileViGv2
MobileViGv2 uses Mobile Graph Convolution (MGC) to demonstrate the effectiveness of our approach. -
PointConv: Deep Convolutional Networks on 3D Point Clouds
3D point clouds are irregular and unordered, hence applying convolution on them can be difficult. PointConv can be applied on point clouds to build deep convolutional networks. -
YCB-Video dataset
The YCB-Video dataset contains 92 videos of 21 objects with varying textures and sizes under cluttered indoor environments. -
LINEMOD-OCCLUSION dataset
The LMO dataset is a subset of the LM dataset consisting of eight objects in more cluttered scenes. -
LINEMOD dataset
The LM consists of 13 objects with approximately 1.2K images per object. We follow the settings described in [2], which uses 15% of the data for training and the rest for testing. -
ImageNet Large Scale Visual Recognition Challenge 2012
This dataset is used to evaluate the performance of a Convolutional Neural Network (CNN) on the ImageNet Large Scale Visual Recognition Challenge (ILSVRC2012).