-
Mutant and LEGO Dataset
The Mutant and LEGO dataset is a dynamic scene dataset. It contains 90% images for training and 10% images for evaluation. -
Tanks and Temples Advanced (T&T) Dataset
The Tanks and Temples Advanced (T&T) dataset is a benchmark dataset for image-based 3D reconstruction. It contains 90% images for training and 10% images for evaluation. -
Visual Wake Words (VWW) dataset
The Visual Wake Words (VWW) dataset consists of high-resolution images that include visual cues to 'wake-up' AI-powered home assistant devices. -
HSViT: Horizontally Scalable Vision Transformer
This paper introduces a horizontally scalable vision transformer (HSViT) scheme with a novel image-level feature embedding. The design of HSViT preserves the inductive bias from... -
ADE20k for semantic segmentation
The dataset used in this paper is ADE20k for semantic segmentation. -
SPL2018 and DsTok datasets for computer-generated image detection
The SPL2018 and DsTok datasets for computer-generated image detection -
Dual Stream Computer-Generated Image Detection Network Based on Channel Joint...
The proposed dual stream convolutional neural network for computer-generated image detection -
CNN Models
The dataset used in this paper is a large variety of popular CNN models, such as straight-forward, complicated-connected, and grouped architectures. -
ModelNet40, ModelNet10
The dataset used in the paper is ModelNet40 and ModelNet10, which are subsets of ShapeNet. -
ShapeNet, ModelNet40, ModelNet10
The dataset used in the paper is ShapeNet, a large-scale dataset of 3D models, and ModelNet40 and ModelNet10, which are subsets of ShapeNet. -
MobileDepth: Efficient Monocular Depth Prediction on Mobile Devices
Depth prediction is fundamental for many useful applications on computer vision and robotic systems. On mobile phones, the performance of some useful applications as augmented... -
CIFAR10, CIFAR100, ImageNet
MobileNets, MnasNets, EfficientNets, and ResNets -
Residual Networks
Residual Networks (ResNet) is composed of stacked entities referred to as residual blocks. A Residual Block of ResNet contains a module and an identity loop. -
Real-world Vehicle Point Cloud
The dataset used in this paper is a real-world vehicle point cloud collected from a real vehicle self-driving process. -
PoseAction: Action Recognition for Patients in the Ward using Deep Learning A...
Real-time intelligent detection and prediction of subjects' behavior particularly their movements or actions is critical in the ward. -
DepthP+P: Metric Accurate Monocular Depth Estimation using Planar and Parallax
DepthP+P: A method for self-supervised monocular depth estimation using planar and parallax. -
3D Point Clouds
The dataset used in this paper is a collection of 3D point clouds.